LIE ALGEBRAS 


1. BACKGROUND 


In this section I use some material, like multivariable analysis, which is not necessary for the main body of the 
course, but if you know it (and hopefully if you don’t but are willing to think imprecisely at some points) it will 
help to put the course in context. For those worried about such things, fear not, it is non-examinable. 


In mathematics, group actions give a way of encoding the symmetries of a space or physical system. 
Formally these are defined as follows: an action of a group G ona space’ X isa mapa: Gx X > X, 
written (g.c) + a(g, x) or more commonly (g, x) ++ g.x which satisfies the properties 

(1) e.x = z, for all x € X, where e € G is the identity; 

(2) (g1g2).@ = g1-(g2.x) for all g1, g2 E Ganda € X. 
Natural examples of actions are that of the group of rigid motions SO; on the unit sphere {x € R? : 
||z|| = 1}, or the general linear group GL, (R) on R”. 

Whenever a group acts on a space X, there is a resulting linear action (a representation) on the vector 
space of functions on X. Indeed if Fun(X) denotes the vector space of real-valued functions on X, then 
the formula 

g(f)(@)=f(g*-2), (g €G, f € Fun(X),2€ X), 
defines a representation of G on Fun(X). If X and G have more structure. e.g. that of a topological 
space or smooth manifold, then this action may also preserve the subspaces of say continuous, or 
differentiable functions. Lie algebras arise as the “infinitesimal version” of group actions, which loosely 
speaking means they are what we get by trying to differentiate group actions. 


Example 1.1. Take for example the natural action of the circle St by rotations on the plane R?. This 
action can be written explicitly using matrices: 


_ f cos(t) —sin(t) 
g(t) = ( sin(t)  cos(t) 
where we have smoothly parametrized the circle S' using the trigonometric functions. Note that for 
this parametrization, g(t)~' = g(—t). The induced action on Fun(R?) restricts to an action on C% (R?) 
the space of smooth (i.e. infinitely differentiable) functions on R°. Using our parametrization, it makes 


sense to differentiate this action at the identity element (i.e. at t = 0) to get an operation v: C% (R?) > 
C~(R?), given by 


It is immediate from the product rule for differentiation, that the operator v constructed in the above 

example obeys the “Leibniz rule”: 
V(f.g) =U(f).g + f(g). 

An operator on smooth functions which satisfies this property is called a derivation. It’s not hard to 
see that any such operator on C® (R?) will be of the form a(x, y) + + b(a, DEn where a,b € C®(R?). 
Thus, heuristically for now, we think of the infinitesimal version of a group action as the collection of 
derivations on smooth functions we obtain by “differentiating the group action at the identity element”. 
(For the circle the collection of vector fields we get are just the scalar multiples of the vector field v, but 
for actions of larger group we would attach a larger space of derivations). It turns out this set of 
derivations forms a vector space, but it also has a kind of “product” which is a sort of infinitesimal 
remnant of the group multiplication’. Let’s set this up a little more formally. 


Date: November, 2012. 
tm being deliberately vague here about what a “space” is, X could just be a set, but it could also have a more geometric 
nature, such as a topological space or submanifold of R”. 
2To be a bit more precise, it comes from the conjugation action of the group on itself. 
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Definition 1.2. A vector field on X = R” (or, with a bit more work, any manifold) is a (smooth) function 
v: R” — R”, which one can think of as giving the infinitesimal direction of a flow (e.g. of a fluid, 
or an electric field say). The set of vector fields forms a vector space which we denote by Ox. Such 
fields can be made to act on functions f: X — R by differentiation. If v = (a), a2,...,a@,,) in standard 
coordinates, then set 


i=1 
This formula gives an action of Ox on the space of smooth functions C% (X), since if f € C°(X), then 
so is v(f). This action is linear, and interacts with multiplication of functions according to the Leibniz 
rule, that is, if v is a vector field, and fı, fo € C (X) then 


v( fife) =v(fi)-f2 + fru (fe), 


in other words, vector fields act as derivations on smooth functions in the above sense. Note we can 
talk about vector fields and derivations interchangeably, since the derivation given by a vector field 
completely determines the vector field, and any derivation comes from a vector field (this is an exercise 
that is worth checking). 


Note that if we compose two derivations vı o v2 we again get an operator on functions, but it is 
not given by a vector field, since it involves second order differential operators. However, it is easy 
to check using the symmetry of mixed partial derivatives that if vı, v2 are derivations, then [11,172] = 
vı © V2 — V2 o n is again a derivation. Thus the space Ox of vector fields on X is equipped with a 
natural product? [., .] which is called a Lie bracket. The derivatives of a group action give subalgebras of 
the algebra Ox. 


Example 1.3. Consider the action of SO3(R) on R*. Using the fact that every element of SO3(R) is a 
rotation about some axis through the origin it is not too hard to find the space of vector fields on R? 
which can be associated to this action, and check that it forms a Lie algebra. Indeed as we saw before, 
the action of the circle fixing the z-axis gives the derivation —y + se, and the derivation obtained 
from rotation about any other axis will be obtained by an orthogonal change of coordinates. It can 


be shown that these form the 3-dimensional space g = span, {x iy y, yg -z Hz 4 — xt}, and 
moreover it is then not hard to check that g is closed under the bracket operations |[-, -]. (This also gives 


a non-trivial example of a 3-dimensional Lie algebra). 


2. DEFINITIONS AND EXAMPLES 


The definition of a Lie algebra is an abstraction of the above example of the product on vector fields. 
It is purely algebraic, so it makes sense over any field k. 


Definition 2.1. A Lie algebra over a field k is a pair (g, |., .],) consisting of a k-vector space g, along with 
a bilinear form |., .],: g x g — g taking values in g known as a Lie bracket, which satisfies the following 
axioms: 

(1) [.,.], is alternating, i.e. |x, a], = 0 forall z € g. 

(2) The Lie bracket satisfies the Jacobi Identity: that is, for all x,y,z € g we have: 


[x, [y, Zlala + ly, [z,Z]a]g + [z, [z, Ylalg =0. 


Remark 2.2. It is easy to check directly from the definition that the Lie bracket we put on the space of 
vector fields O y satisfies the above conditions. 

Note that by considering the bracket |x + y,x + y]g it is easy to see that the alternating condition 
implies that for all x,y € L we have |z, y], = —[y, 2g, that is [.,.], is skew-symmetric. If char(k) # 2, 
the alternating condition is equivalent to skew-symmetry. Note that a Lie algebra is an algebra with 
a product which is neither commutative nor associative, and moreover it does not have an identity 
element*. We will normally simply write |., .] and reserve use the decorated bracket only for emphasis 
or where there is the potential for confusion. 


Example 2.3. (1) If V is any vector space then setting the Lie bracket [.,.] to be zero we get a (not 
very interesting) Lie algebra. Such Lie algebras are called abelian Lie algebras. 


3This is in the weakest sense, in that it is a bilinear map Ox x Ox — Ox. It is not even associative — the axiom it does satisfy 
is discussed shortly. 
4This makes them sound awful. However, as we will see this is not the way to think about them! 
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(2) Generalising the example of vector fields a bit, if A is a k-algebra and 6: A — A is a k-linear 
map, then we say ô is a k-derivation if it satisfies the Leibniz rule, that is, if: 


ô(a.b) = d(a).b+a.6(b), Va,b EA. 


It is easy to see by a direct calculation that if Der(A) denotes the k-vector space of k-derivations 
on A, then Der(A) is a Lie algebra under the commutator product, that is: 


(51, da] = ôi O 69 oa bg O 64. 


Indeed the alternating property is immediate, so the only thing to check is the Jacobi identity, 
which is an easy computation. 

(3) For a more down-to-earth example, take g = gl,, the k-vector space of n x n matrices with 
entries in k. It is easy to check that this is a Lie algebra for the commutator product: 


[X,Y] = X.Y -Y.X. 


Slightly more abstractly, if V is a vector space, then we will write gl(V) for the Lie algebra 

End(V) equipped with the commutator product as for matrices. 

If g is a Lie algebra and N < g is a k-subspace of g on which the restriction of the Lie bracket 

takes values in N, so that it induces a bilinear form [.,.]v: Nx N —> N, then (N, [., |v) is clearly 

a Lie algebra, and we say N is a (Lie) subalgebra of g. 

(5) Let sl, = {X € gl, : tr(X) = 0} be the space of n x n matrices with trace zero. It is easy to check 
that sl, is a Lie subalgebra of gl,, (even though it is not a subalgebra of the associative algebra 
End(V)). More generally we say any Lie subalgebra of gl(V) for a vector space V is a linear Lie 
algebra. 

(6) If A is an associative k-algebra, then if a € A let ôa: A — Abe the linear map given by 


a(b) =a.b—b.a, bEA. 
One can check that 6, is a derivation on A, and that this is equivalent to the statement that 
(A, [.,.]4) is a Lie algebra, where |., .| 4 is the commutator bracket on A, that is [a, b]4 = a.b— b.a. 


Thus any associative algebra can be given the structure of a Lie algebra. (This is a generalisation 
of the case of n x n matrices). 


(4 
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Remark 2.4. One could begin to try and classify all (say finite-dimensional) Lie algebras. In very low 
dimension this is actually possible. For dimension 1 clearly there is a unique (up to isomorphism”) 
Lie algebra since the alternating condition demands that the bracket is zero. In dimension two, one 
can again have an abelian Lie algebra, but there is another possibility: if g has a basis {e, f} then we 
may set |e, f] = f, and this completely determines the Lie algebra structure. All two-dimensional 
Lie algebras which are not abelian are isomorphic to this one (check this). It is also possible to classify 
three-dimensional Lie algebras, but it becomes rapidly intractable to do this in general as the dimension 
increases. In this course we will focus on a particular class of Lie algebras, known as semisimple Lie 
algebras, for which an elegant classification theorem is known. 


3. HOMOMORPHISMS AND IDEALS 


We have already introduced the notion of a subalgebra of a Lie algebra in the examples above, 
but there are other standard constructions familiar from rings which make sense for Lie algebras. A 
homomorphism of Lie algebras (g, |., .],) and (g’, [.,.],’) is a k-linear map ¢: g — g’ which respects the 
Lie brackets, that is: 

o([a, bla) = [ġ(a), (bla Va, b € g. 
An ideal in a Lie algebra g is a subspace J such that for all a € g and x € I we have [a,x], € J. It 
is easy to check that if 6: g —> g’ is a homomorphism, then ker(¢) = {a € g : ¢(a) = 0} is an ideal 
of g. Conversely, if I is an ideal of g then it is easy to check that the quotient space g/J inherits the 
structure of a Lie algebra, and the canonical quotient map q: g > g/J is a Lie algebra homomorphism 
with kernel J. 


Remark 3.1. Note that because the Lie bracket is skew-symmetric, we do not need to consider notions 
of left, right and two-sided ideals, as they will all coincide. If a nontrivial Lie algebra has no nontrivial 
ideals we say it is simple. 


Just as for groups and rings, we have the normal stable of isomorphism theorems, and the proofs 
are identical. 


5Of course I haven’t said what an isomorphism of Lie algebras is yet (see below) but you probably know... 
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Theorem 3.2. (1) Let 6: g — g' be a homomorphism of Lie algebras. The subspace ¢(g) = im(@) is a 
subalgebra of g' and ¢ induces an isomorphism @: g/ker(p) > im(¢). 
(2) IfI c J C gare ideals of g then we have: 


(9/J)/(I/J) = g/J 
(3) If I, J are ideals of g then we have 
(I+ J)/J2I/iIn J). 


4. REPRESENTATIONS OF LIE ALGEBRAS 


Just as for finite groups (or indeed groups in general) one way of studying Lie algebras is to try and 
understand how they can act on other objects. For Lie algebras, we will use actions on linear spaces, or 
in other words, “representations”. Formally we make the following definition. 


Definition 4.1. A representation of a Lie algebra g is a vector space V equipped with a homomorphism 
of Lie algebras p: g > gl(V). In other words, p is a linear map such that 


p([x, y]) = p(x) © p(y) — ply) © plx) 


where o denotes composition of linear maps. We may also refer to a representation of g as a g-module. 
A representation is faithful if ker(g) = 0. When there is no danger of confusion we will normally 
suppress p in our notation, and write x(v) rather than p(x)(v), for x € g,v E€ V. 


We will study representation of various classes of Lie algebras in this course, but for the moment we 
will just give some basic examples. 


Example 4.2. (1) If g = gl(V) for V a vector space, then the identity map gl(V) — glI(V) is a rep- 
resentation of gl(V) on V, which is known as the vector representation. Clearly any subalgebra 
g of gl(V) also inherits V as a representation, where then the map p is just the inclusion map. 
(2) Given an arbitrary Lie algebra g, there is a natural representation ad of g on g itself known as 
the adjoint representation. The homomorphism from g to gl(g) is given by 


ad(x)(y) = [x,y], Vz,y €g. 


Indeed the fact that this is a representation is just a rephrasing’ of the Jacobi identity. Note that 
while the vector representation is clearly faithful, in general the adjoint representation is not. 
Indeed the kernel is known as the centre of g: 


alg) = {x € g : [x,y] = 0, Vy € g}. 


Note that if x € 3(g) then for any representation p: g > gl(V) the endomorphism p(x) com- 
mutes with all the elements p(y) € End(V) for all y € g. 

(3) If g is any Lie algebra, then the zero map g — gl, is a Lie algebra homomorphism. The corres- 
ponding representation is called the trivial representation. It is the Lie algebra analogue of the 
trivial representation for a group (which send every group element to the identity). 

(4) If (V, p) is a representation of g, we say that a subspace U < V is a subrepresentation if ¢(x)(U) C 
U for all x € g. It follows immediately that ¢ restricts to give a homomorphism from g to gl(U), 
hence (U, ¢\y7) is again a representation of g. Note also that if {V; : i € I} are a collection of in- 

variant subspaces, their sum )},., Vi is clearly also invariant, and so again a subrepresentation. 

There are a number of standard ways of constructing new representations from old, all of which 

have their analogue for group representations. For example, recall that if V is a k-vector space, and U 

is a subspace, then we may form the quotient vector space V/U. If 6: V + V is an endomorphism of V 

which preserves U, that is if ¢(U) C U, then there is an induced map ¢: V/U — V/U. Applying this to 

representations of a Lie algebra g we see that if V is a representation of g and U is a subrepresentation 
we may always form the quotient representation V/U. Next, if (V, p) and (W,o) are representations of 

g, then clearly V © W the direct sum of V and W becomes a g-representation via the obvious homo- 

morphism p © ø. More interestingly, the vector space Hom(V, W) of linear maps from V to W has the 

structure of a g-representation via 


(4.1) xt(d) =o(4)od—Gop(x), Yx Eg, € Hom(V, W). 


®Check this! It’s also (for some people) a useful way of remembering what the Jacobi identity says. 
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It is straight-forward to check that this gives a Lie algebra homomorphism from g to gl(Hom(V, W)). 
One way to do this is simply to compute directly”. Another, slightly quicker way, is to notice that 
End. (V $ W) = Hom, (V @ W, V @ W) contains End, (V), End,(W) and Hom, (V, W) since 


Hom, (V $ W, V @W) = Hom, (V, V) 6 Hom, (V, W) 6 Hom, (W, V) $ Hom, (W, W), 


thus we may combine p and a to give a homomorphism 7: g > gl(V pW). Since Hom, (V, W) is clearly 
preserved by 7(g), it is a subrepresentation of gl(V © W), where the latter is a g-representation via the 
composition of T: g > gl(V © W) with the adjoint representation ad: gl(V 6 W) > gl(gl(V œ W)). It 
is then easy to see that the equation (4.1) describes the resulting action of g on Hom, (V, W). 

An important special case of this is where W = k is the trivial representation (as above, so that the 
map g: g — gl(k) is the zero map). This allows us to give V* = Hom(V,k), the dual space of V, a 
natural structure of g-representation where (since ø = 0) the action of x € gon f € V* is given by 
p*: g > gl(V*) where 

ENF) =—fo(e(@)) (fe Vv"). 
Ifa: V > V is any linear map, recall that the transpose map a’: V* — V* is defined by a‘(f) = f o qa, 
thus our definition of the action of x € g on V* isjust® —p(x)*. This makes it clear that the action of g on 
V* is compatible with the standard constructions on dual spaces, e.g. if U is a subrepresentation of V, 
the U” the annihilator of U will be a subrepresentation of V*, and moreover, the natural isomorphism 
of V with V** is an isomorphism of g-representations. 

We end this section with some terminology which will be useful later. 


Definition 4.3. A representation is said to be irreducible if it has no proper non-zero subrepresentations, 
and it is said to be completely reducible if it is isomorphic to a direct sum of irreducible representations. 


Example 4.4. Giving a representation of gl, is equivalent to giving a vector space equipped with a 
linear map. Indeed as a vector space gl, = k, hence if (V, p) is a representation of gl, we obtain a 
linear endomorphism of V by taking p(1). Since every other element of gl, is a scalar multiple of 1 this 
completely determines the representation, and this correspondence is clearly reversible. 

If we assume k is algebraically closed, then you know the classification of linear endomorphisms is 
given by the Jordan canonical form. From this you can see that the only irreducible representations 
of gl, are the one-dimensional ones, while indecomposable representations correspond to linear maps 
with a single Jordan block. 


5. NILPOTENT LIE ALGEBRAS 


We now begin to study particular classes of Lie algebra. The first class we study are nilpotent Lie 
algebras, which are somewhat analogous to nilpotent groups. We need a few more definitions. 


Definition 5.1. If V,W are subspaces of a Lie algebra g, then write [V, W] for the linear span of the 


elements {[v, w] : v € V,w € W}. Notice that if J, J are ideals in g then so is |Z, J]. Indeed to check this, 
note thatifi € J,7 € J,x € g we have: 


using the Jacobi identity in the first equality and skew-symmetry in the second. 
8 ty quality y- ry. 


Definition 5.2. For g a Lie algebra, let C°(g) = g, and C’ (g) = [g, C’~'(g)] for i > 1. This sequence of 
ideals of g is called the lower central series of g, and we say g is nilpotent if C™ (g) = 0 for some N > 0. If 
N is the smallest integer such that C™ (g) = 0 then we say that g is an N-step nilpotent Lie algebra. 


For example, a Lie algebra is 1-step nilpotent if and only if it is abelian. The definition can be 
rephrased as follows: there is an N > 0 such that for any N elements 21, %2,...,«N of g the iterated Lie 
bracket 

(v1, [x2,[-.-,[@n—1,2N]]---] = ada, (adro(...adxy_i(ay))...) =0. 
In particular, all the elements ad(a) € gl(g) for x € g are nilpotent. 


Remark 5.3. The ideal C1(g) = [g, g] is known as the derived subalgebra? of g and is also denoted!” D(g) 
and sometimes g’. 


7As I did in the previous version of the notes. 
8Note that the minus sign is crucial to ensure this is a Lie algebra homomorphism — concretely this amounts to noticing that 
8 8 P y 8 

A++ —A! preserves the commutator bracket on n x n matrices. 

°Oddly, not as the derived ideal even though it is an ideal. 

10Partly just to cause confusion, but also because it comes up a lot, playing slightly different roles, which leads to the different 

yJ P Playing sightly 
notation. We'll see it again shortly in a slightly different guise. 
8 y gnuy 8 
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Lemma 5.4. Let g be a Lie algebra. Then 


(1) If g is nilpotent, so is any subalgebra or quotient of g. 
(2) If g/3(g) is nilpotent, then g is nilpotent. 


Proof. The first part is immediate from the definition. Indeed if h C g is a subalgebra of g then clearly 
we have C"(h) C C*(g), so that if C™ (g) = 0 we also have C™ (b) = 0. Similarly, an easy induction say 
shows that if b is an ideal, then C*(g/h) = (C’(g) + b)/b, and so again if C™ (g) = 0 we must also have 
CN (g/b) = 0. 

For the second claim, taking h = 3(g), an ideal of g, we see from the first part that if g/3(g) is nilpotent 
so that there is some N with (C% (g) +3(g))/3(g) = C™ (g/3(g)) = 0, and so C™ (g) C 3(g). But then it is 
clear that C+! (g) = 0, and so g is nilpotent as required. 


Remark 5.5. Notice that if I is an arbitrary ideal in g, and J and g/J are nilpotent it does not follow that 
g is nilpotent. Indeed recall that if g is the non-abelian 2-dimensional Lie algebra, then g can be given a 
basis x, y with |x, y] = y. Hence k.y is a 1-dimensional ideal in g (which is thus abelian and so nilpotent) 
and the quotient is again 1-dimensional and so nilpotent. However, ad(x) has y as an eigenvector with 
eigenvalue 1, so g cannot be nilpotent, and indeed C’ (g) = k.y for all i > 1. 


Example 5.6. Let V be a vector space, and 
FH=(0=FoCHMChc...CcCk,=V) 


a sequence of subspaces with dim(F;) = i (such a sequence is known as a flag or complete flag in V). Let 
n = nç be the subalgebra of gl(V) consisting of linear maps X € gl(V) such that X(F;) C Fj_, for all 
i > 1. We claim the Lie algebra n is nilpotent. To see this we show something more precise. Indeed for 
each positive integer k, consider the subspace 


në = {x € gl(V) : z(F;) C Fj} 


(where we let 0 = F; for all | < 0). Then clearly n* C n, and n* = 0 for any k > n. We claim that 
C*(n) C n**1, which will therefore prove n is nilpotent. The claim is immediate for k = 0, so suppose 
we know by induction that C*(n) C n*t!. Then if x € n and y € n*t!, we have xy(F;) C a(F;-k-1) C 
F;i—k—2, and similarly yx(F;) C F;-p—2, thus certainly [x,y] € n+? and so C*t!(n) C n**? as required. 

In fact you can check that C*(n) = n**1, so that n is (n — 1)-step nilpotent i.e. C"~?(n) ¢ 0, and 
C”~!(n) = 0 (note that if dim(V) = 1 then n = 0). If we pick a basis {e1, e2, ..., en} of V such that 
F; = span(e1, e€2,...,e;} then the matrix A representing an element x € n with respect to this basis is 
strictly upper triangular, that is, a;; = 0 for alli > j. It follows that dim(n) = E) so when n = 2 we 
just get the 1-dimensional Lie algebra, thus the first nontrivial case is when n = 3 and in that case we 
get a 3-dimensional 2-step nilpotent Lie algebra. 


Note that the subalgebra t C gl,, of diagonal matrices is nilpotent, since it is abelian, so a nilpotent 
linear" Lie algebra need not consist of nilpotent endomorphisms. Nevertheless we will show there is 
a close connection between the two notions. 


Lemma 5.7. If x € gl(V) be a nilpotent endomorphism then ad(x) € gl(gl(V)) is also nilpotent. 


Proof. The map àz: gl(V) > gl(V) given by y +> zy is clearly nilpotent if x is nilpotent, and similarly 
for the map pz: gl(V) > gl(V) given by y +> yx. Moreover, Àz and pz clearly commute with each other, 
so since ad(x) = Az — Px, itis also nilpotent. Indeed for m > 0 we have 


(As = pa)” = Yn (ee 


and so if z” = 0, so that A? = p} = 0, then if m > 2n, every term on the right-hand side must be zero, 
and so ad(x)™ = 0 as required. 


For the next proposition we need the notion of the normaliser of a subalgebra. 
Definition 5.8. Let g be a Lie algebra and let a be a subalgebra. The subspace 
N,(a) = {x € g: [x,a] € a, Va € a} 
is a subalgebra of g (check this using the Jacobi identity) which is called the normaliser of a. It is the 


largest subalgebra of g in which a is an ideal. 


MWe say a Lie algebra is linear if it is realized as a subalgebra of g!(V) for some finite-dimensional vector space V. 
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Proposition 5.9. Let n be a Lie algebra, and (V, p) a representation of n such that for every x € n, the linear 
map p(x) is nilpotent. Then the subspace 


V” = {v eV: p(x)(v) =0, Vr E€ n}. 
is non-zero. 


Proof. We use induction on d = dim(n), the case d = 1 being clear. Clearly the statement of the propos- 
ition is unchanged if we replace n by its image in gl(V), so we may assume that n is a subalgebra of 
gl(V). Now consider a © n a proper subalgebra”. Since the elements a € a are nilpotent endomorph- 
isms, the previous lemma shows that ad(a) € gl(n) is also nilpotent, hence ad(a) also acts nilpotently 
on the quotient” n/a. Since dim(a) < dim(n), by induction (applied with V = n/a and a) we can find 
x ¢ a such that ad(a)(x) = [a, x] € a for all a € a. It follows that for any subalgebra a its normaliser 


Nala) = {x En: [x,a] € a,Va € a} 


strictly contains a. Thus if a is a proper subalgebra of n of maximal dimension, we must have Na (a) = n, 
or in other words, a must actually be an ideal of n. 

Now if n/a is not one-dimensional, the preimage of a one-dimensional subalgebra in n/a would be 
a proper subalgebra of n strictly containing a, which again contradicts the maximality of a. Thus n/a is 
one-dimensional, and we may find z € nso that k.z ® a =n. 

By induction, we know that V* = {v € V : a(v) = 0,Va € a} is a nonzero subspace of V. We claim 
that z preserves V*. Indeed 


a(z(v)) = [a, z|(v) + z(a(v)) =0, Va€a, vev", 


since [a, z] € a. But the restriction of z to V° is nilpotent, so the subspace U = {v € V": z(v) = 0} is 
nonzero. Since U = V" we are done. 


Remark 5.10. The notation V" used above will be used later: if g is any Lie algebra and (V, p) is a 
representation of g, then we write 


V? = {v E V : p(x)(v) = 0,Va E€ g}. 
It is called the (possibly zero in general) subrepresentation of invariants in V. 


Definition 5.11. If g is a Lie algebra and x € g, we say that an element z is ad-nilpotent if ad(x) € gl(g) 
is a nilpotent endomorphism. For convenience" we say that g is ad-nilpotent if all of its elements are. 


Theorem 5.12. (Engel’s theorem) A Lie algebra g is nilpotent if and only if ad(x) is nilpotent for every x € g. 


Proof. In the terminology of the above definition, the theorem states that a Lie algebra is nilpotent if 
and only if it is ad-nilpotent. As we already noted, it is immediate from the definition that a nilpotent 
Lie algebra is ad-nilpotent, so our task is to show the converse. For this, note that it is clear that if g is 
ad-nilpotent, then any quotient of it is also ad-nilpotent. Hence using induction on dimension along 
with Lemma 5.4 (2), it will be enough to show that 3(g) is a non-zero. Now 


3(9) = {2 € g : [2,2] =0, Var € g} 
= {z € g : —ad(x)(z) = 0, Yx € g} 


= gio, 


hence applying Proposition 5.9 to the adjoint representation of g, it follows immediately that 3(g) 4 0 
and we are done. 


The following is an important consequence of the Proposition 5.9. 


Corollary 5.13. Let g be a Lie algebra and (V, p) a representation of g such that p(x) is a nilpotent endomorphism 
for all x € g. Then there is a complete flag F = (0 = Fo C Fi C ... C Fa = V) such that p(g) C ne. 


12ih lecture I insisted that a was non-zero, but actually we don’t need this. 

13The notation here can be a bit confusing: the Lie algebra n is an a-representation by the restriction of the adjoint represent- 
ation of g to a. Since a is a sub-a-representation of g, the quotient g/a is an a-representation. Note however that it is not itself a 
Lie algebra unless a is an ideal of g. 

“Though because of the following theorem this terminology won’t be used much. 
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Proof. Let us say that g respects a flag F if p(g) C nz. Use induction on dim(V). By the Proposition 
5.9, we see that the space V! # 0. Thus by induction we may find a flag F’ in V/V" which n respects. 
Taking its preimage and extending arbitrarily (by picking any complete flag in V!) we get a complete 
flag in V, we obtain a flag which g clearly respects as required. 


6. SOLVABLE LIE ALGEBRAS 


We now consider another class of Lie algebras which is slightly larger than the class of nilpotent 
algebras. For a Lie algebra g, let D°g = g, and D’*'g = [D’g, D’g]. D’g is the i-th derived ideal of g. 
Note that C'(g) = D'g is Dg the derived subalgebra of g. 


Definition 6.1. A Lie algebra g is said to be solvable if D g = 0 for some N > 0. 


Since it is clear from the definition that D’g C C*(g), any nilpotent Lie algebra is solvable, but as 
one can see by considering the non-abelian 2-dimensional Lie algebra, there are solvable Lie algebras 
which are not nilpotent. 


Example 6.2. Let V be a vector space and F = (0 = Fy < Fi <... < Fa = V) a complete flag in V. Let 
br = {re gl(V) : 2(Fi) C Fi}, 


that is, bz is the subspace of endomorphisms which preserve the complete flag F. We claim that b+ 
is solvable. Since any nilpotent Lie algebra is solvable, and clearly g is solvable if and only if Dtg is, 
the solvability of g will follow if we can show that D'g = nz. To see this, suppose that x,y € bz and 
consider |x, y]. We need to show that |z, y|(Fi) C Fi-1 for each i, 1 <i < n. Since x,y € bp, certainly 
we have |x, y](F;) C F; for alli, 1 < i < n, thus it is enough to show that the map |z, y] induced by [z, y] 
on F;/F,;_1 is zero. But this map is the commutator of the maps induced by x and y in End(F;/F;_1), 
which since F; /F;_1 is one-dimensional, is abelian, so that all commutators are zero. 

If we pick a basis {e1, €2,...,€n} of V such that F; = span(ei,...,e;), then gl(V) gets identified with 
gl,, and bz corresponds to the subalgebra 6, of upper triangular matrices. It is straight-forward to 
show by considering the subalgebra t,, of diagonal matrices that b, is not nilpotent. 


We will see shortly that, in characteristic zero, any solvable linear Lie algebra g C gI(V) is a subal- 
gebra of bz for some complete flag F. We next note some basic properties of solvable Lie algebras. 


Lemma 6.3. Let g be a Lie algebra, 6: g — b a homomorphism of Lie algebras. 
(1) If ¢ is surjective, then (D*g) = D*(¢(g)). In particular $(g)) is solvable if g is, thus any quotient of 
a solvable Lie algebra is solvable. 
(2) If g is solvable then so are all subalgebras of g. 
(3) Ifim(@) and ker(@) are solvable then so is g. Thus if I is an ideal and I and g/I are solvable, so is g. 


Proof. The first two statements are immediate from the definitions. For the third, note that if im(@) is 
solvable, then for some N we have Dim(¢) = {0}, so that by part (1) we have D” (g) C ker(¢), hence 
if D“ker(¢) = {0} we must have DN+™g = {0} as required. 


Note that, as mentioned above, the part 3) of the above Lemma is false for nilpotent Lie algebras. 
6.1. Lie’s theorem. In this section we will assume that our field k is algebraically closed of characteristic zero. 


Lemma 6.4. (Lie’s Lemma) Let g be a Lie algebra and let I C g be an ideal, and V a finite dimensional repres- 
entation. Suppose v € V is a vector such that x(v) = A(x).v for all x € I, where A: I — k. Then A vanishes on 
[9,7] CL. 


Proof. Let x € g. For each m € N let Wm = span(v, x(v),...,27(v)}. Thus the Wm form a nested 
sequence of subspaces of V. We claim that hz™(v) € A(h)z™v + Wm-ı for all h € I and m > 0. Using 
induction on m, the claim being immediate for m = 0, note that 


ha™(v) = [h, s]! (v) + zh”! (v) 
E (A((h, z])2 tu + Wm-2) + 2(A(h)2™—*(v) + Wm-2) 
E A(h)a™(v) + Wm-1, 
where in the second equality we use induction on m for both h, [h, z] € T. 
Now since V is finite dimensional, there is a maximal n such that the vectors {v,z(v),...,2"(v)} 
are linearly independent, and so Wm = Wn for all m > n. It then follows from the claim that W, is 


preserved by x and every h € I. Moreover, the claim shows that for any h € I the matrix of |x, h] 
with respect to the basis {v, z(v)...,2"(v)} of Wn is upper triangular with diagonal entries all equal to 
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(a, h]). It follows that tr({x, h]) = (n + 1)X([x, h]). Since the trace of a commutator is zero”, it follows 
that (n + 1)A({x, h]) = 0, and so since char(k) = 0 we conclude that A([x, h]) = 0. 


Theorem 6.5. (Lie’s theorem) Let g be a solvable Lie algebra and V is a g-representation. Then there is a 
homomorphism à: g > gl, (k) and a nonzero vector v € V such that x(v) = A(x).v forall x € g. 


Proof. We use induction on dim(g). If dim(g) = 1, then g = k.a for any nonzero x € g, and since k is 
algebraically closed, x has an eigenvector in V and we are done. For dim(g) > 1, consider the derived 
subalgebra D'g. Since gis solvable, D'g is a proper ideal of g. The quotient g/ Dtg is abelian, and taking 
the preimage of any codimension one subspace of it gives a codimension 1 ideal I of g. By induction 
we may pick a homomorphism à: I — k such that the subspace U = {w € V : x(w) = A(h).w, VA € I} 
is nonzero. Now if x € g, then 
h(a(w)) = [h, z] (w) + ch(w) 
= X([h, 2])(w) + Alh) x(w) 
= A(h).2(w). 


where in the second equality we used Lie’s Lemma. Thus g preserves U. Now since J is codimension 
one in g, we may write g = kx @ I for some x € g. Taking an eigenvector v € U of x completes the 
proof. 


The analogue of Corollary 5.13 for solvable Lie algebras is the following: 


Corollary 6.6. Let b C gl(V) be a solvable subalgebra of gl(V). Then there is a complete flag F of V such that 
b C be. 


Proof. By induction on dim(V), where Lie’s theorem provides the induction step. 


Note that this theorem shows that if g is a solvable Lie algebra, then any irreducible representation of 
g is one-dimensional. Since the converse is clearly true, we can rephrase Lie’s theorem as the statement 
that any irreducible representationa of a solvable Lie algebra is one-dimensional. As such it is natural 
to note the following simple Lemma. 
Lemma 6.7. Let g be a Lie algebra. The one-dimensional representations of g are parametrized by the vector 
space (g/Dg)* as follows: if (V, p) is a a representation of g and dim(V) = 1, then p: g —> End(V) = k, and 
p(Dg) = 0, so that p induces a linear functional a: g/ Dg —> k. 
Proof. If p: g + gl(V), then since dim(V) = 1, the associative algebra End(V) is commutative, so that 
gl(V) is abelian. Thus 

pliz, y] = p@)e(y) — puel) =0, (Va,y € 9), 

hence p(Dg) = 0, and p induces a € (g/Dg)* as claimed. Conversely, given a: (g/Dg)*, the map 
p: g > gh (k) given by x +> a(x + Dg) is a Lie algebra homomorphism, because 


plz, y]) = 0 = a(x + Dg)aty + Dg) — aly + Dg)a(x + Dg). 


An isomorphism classes of one-dimensional representations of g are thus given by the space of linear 
functionals (g/Dg)*. We will refer to an element a € (g/Dg)* as a weight of g. They should be thought 
of as the generalization of the notion of an eigenvalue of a linear map. Lie’s theorem shows that if g 
is a solvable Lie algebra (hence in particular if g is nilpotent), then any irreducible representation is 
one-dimensional, so that (g/Dg)* parametrizes the irreducible representations of g. For a € (g/Dg)* 
let us write ka for the representation (k, œ) of g on the field k given by a. 


7. REPRESENTATIONS OF NILPOTENT LIE ALGEBRAS 


In this section we assume that k is an algebraically closed field of characteristic zero. 

A representation (p, V) of the one-dimensional Lie algebra gl (k) on a k-vector space V is given, 
once we chose a basis vector e of gl, (k), by a single linear map ¢: V — V via the correspondence 
o = p(e). Thus the classification of representation of gl} (k) is equivalent to the classification of linear 
endomorphisms’® This classification is of course given by the Jordan normal form (at least over an al- 
gebraically closed field). In this section we will see that a (slightly weaker) version of this classification 
holds for representations of any nilpotent Lie algebra. 


15t is important here that p([x, h]) is the commutator of p(x) and p(h) both of which preserve Wn — by the claim in the case of 
p(h), and by our choice of n in the case of p(x) — in order to conclude the trace is zero. 
16Essentially the same is true for representations of the abelian group Z. 
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7.1. Generalised eigenspaces. We begin by reviewing some linear algebra. Let x: V — V be a linear 
map. Let V) be the generalized eigenspaces for x with eigenvalue A: 


Vy = {vE V : IN > 0,(x — àA)” (w) = 0}. 


(thus V) is zero unless A is an eigenvalue of x). 


Lemma 7.1. Let x: V — V be a linear map. There is a canonical direct sum decomposition 


v=@®n, 


AEk 


of V into the generalized eigenspaces of x. Moreover, for each A, the projection to ax: V — Vy (with kernel the 
remaining generalized eigenspace of x) can be written as a polynomial in x. 


Proof. Let mz € k[t] be the minimal polynomial of x. Then if ¢: k[t] + End(V) given by t +> x denotes 
the natural map, we have k[t]/(mz) = im(@¢) C End(V). If mz = pert — ài)” where the å; are the 
distinct eigenvalues of x, then the Chinese Remainder Theorem shows that 


k 
k[t]/(me) = p k[e]/(t— à)", 


It follows that we may write 1 € k[#]/(m,) as 1 = e1 +... + ex according to the above decomposition. 
Now clearly e;e; = 0 if i # j and e? = e;, so that if U; = im(e;), then we have V = @Q,<,<, Ui. 
Moreover, each e; can be written as polynomials in x by picking any representative in k/t] of e; (thought 
of as an element of k[t]/(m,)). Note in particular this means that each U; is invariant under im(¢). 

It thus remains to check that U; = Vy,. Since (t — A;)"‘e; = 0 € k[t]/(mz), it is clear that U; C Vy,. To 
see the reverse inclusion, suppose that v € Vj, so that, say, (x—A;)” (v) = 0. Write v = vi+...+vx, where 
vj E€ Uj. Now since (x — à;)” (v) = 0, and each U; is stable under im(¢) it follows that (x — A;)"(v;) = 0 
for each j, (1 < j < k). If 1 < l< kand l # k, then since (t — à;)” and (t — ,)™ are coprime, we may 
find a,b € k[t] such that a.(t — Aj)" + b.(t — à )™ = 1. 

Setting t = x in this equation, and applying the result to the vector v; we find: 


v = l.v = (a.(t = ri)” + b(t = X1)”) (v1) 
(7.1) = (t = Aa)” (vi) + (t = Ar)™ (vi) 
0, 


where the first term is zero by the above, and the second term is zero since v, € Uj. It follows that 
v =v; E€ U; and so Vy, C U; as required. 


Next it is convenient to introduce some notation. 

Definition 7.2. A sequence of maps of g-representations 
psy at oy 

is said to be exact at V if im(a) = ker({). A sequence of maps 


0 y Soy ka Wy 0 


is called a short exact sequence if it is exact at each of U, V and W, so that a is injective and £ is surjective 
and im(a) = ker(8). If V is the middle term of such a short exact sequence, it contains a subrepresent- 
ation isomorphic to U, such that the corresponding quotient representation is isomorphic to W, and 
hence, roughly speaking, V is built by gluing together U and W in some fashion. 


Lemma 7.3. i) Let V be a vector space and write A = Hom,(V,V) for the associative algebra of linear 
maps from V to itself. Then we have: 
n n f E 
(= A= wy = (7) ee -VOe Yoyes, 
i=0 
where as usual ad(x)(y) = |x, y] = zy — yz. 
ii) Let g be a Lie algebra and (V, p) a representation of g. Then for all x,y € g we have 
n 7 z 
7.2) (o(2) -à= "o =F È; Jade) -0a = wy. 


i=0 
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Proof. Note first that in any associative algebra A, if a,b € A commute, i.e. ab = ba, then the binomial 
theorem shows that (a + b)” = S07.) ()a’b"™. 

Letr,: A — Abe the map given by r,(y) = yz, right-multiplication by x, and similarly let l: A— A 
be the map lz(y) = xy be the map given by left-multiplication by x. The ad(x) = ls — rz. Then clearly 


le, Ta, A-idy, idy all commute, and we have: 
(z — à- p)"y = (le — à — u)” (y) 
= (ad(x) + rs — à — u) (y) 


where we apply the binomial theorem to the (commuting) pair (ad(x) — À, rs — u) to the fourth line. 
For the second part, simply apply the first part to p(x), p(y), using the fact that ad(p(x))(e(y)) = 
p(ad(x)(y)). 


Theorem 7.4. Let h be a nilpotent Lie algebra and (V, p) a finite dimensional representation of h. For each 
à € (g/Dg)*, set 


VY ={v eV: forall x € ġ,3n > 0 such that (p(x) — A(x))"(v) = 0}. 
Then each Vy is a subrepresentation and 
v= 
A 


that is, V is the direct sum of subrepresentations indexed by the one-dimensional representations occuring in V. 


Proof. We use induction on dim(V). If dim(V) = 1 the result is trivial. Next note that if V = U ẹ W is a 
direct sum of proper subrepresentations, then applying induction we know the result holds for U and 
W respectively, and clearly Vy = U, @ Wy, so that we may conclude the result holds for V. 

Now suppose that x € h and let V = @ Vy (x) denote the generalised eigenspace decomposition of 
V for p(x). We claim each subspace V; (x) is a subrepresentation of h. Indeed since h is a nilpotent Lie 
algebra, the map ad() is nilpotent, hence if k > dim(h) then ad(x)*(y) = 0 for all y € b. It follows that 
if v € V\(x), so that (p(x) — A)*(v) = 0 for all k > N say, then if k > N + dim(b), we see from equation 
(7.2) (with u = 0) that (p(x) — A)*(p(y)(v)) = 0, since we must have either i > dim(b) or k — i > N if 
k > N +dim(h), and so p(y) (v) € Væ) (x) as required. 

It follows that if there is some x € h which has more than one eigenvalue, then the decomposition 
V = @ Vy) (x) shows that V is a direct sum of proper subrepresentations, and we are done by induc- 
tion. Thus we are reduced to the case where for all x € h the linear map p(x) has a single eigenvalue 
A(x) say. But then z +> A(x) must be an element of (h/Dbh)* by Lie’s theorem: Pick a one-dimensional 
subrepresentation L of V, then p(x) acts by a scalar u(x) on L, where u € (h/Dh)* and since A(x) is the 
only eigenvalue of p(x) by assumption we must have A(x) = p(x). Now clearly V = V) and we are 
done. 


Remark 7.5. The (nonzero) subrepresentations Vy are known as the weight spaces of V, and A € (h/Db)* 
for which Vy # 0 are known as the weights of V. 


8. CARTAN SUBALGEBRAS 


In this section we work over an algebraically closed field k. 
Let g be a Lie algebra. Recall that if h is a subalgebra of g then the normalizer Ng(h) of g is 


N,(b) = {x € g : [x, h] € h, VA € b}. 


It follows immediately from the Jacobi identity that N,(b) is a subalgebra, and clearly N,(b) is the 
largest subalgebra of g in which h is an ideal. 


Definition 8.1. We say that a subalgebra h is a Cartan subalgebra if it is nilpotent and self-normalizing, 
that is, N,(h) = b. 
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It is not clear from this definition if a Lie algebra necessarily has a Cartan subalgebra. To show this 
we need a few more definitions. 
Definition 8.2. If x € g, let go,» be the generalized 0-eigenspace of ad (x), that is 
goz = {y € g : IN > Osuch that ad(x)™ (y) = 0} 


We say that x € g is regular if gox is of minimal dimension. Note that it follows from the formula: 


(8.1) ad(a)"lv2] = D> (7) edo" oade) 


k=0 
that go, is a subalgebra of g. (The formula can be established, for example, by an easy induction.) 


Lemma 8.3. Let x € g be a regular element. Then go, is a Cartan subalgebra of g. 


Proof. We first show that b is a self-normalizing in g. Indeed if z € Ng(b) then [2, z] € h (since certainly 
x € b), so that for some n we have ad(x)”([x,z]) = 0, and hence ad(x)"*!(z) = 0 and z € h as 
required”. 

To show that bh is nilpotent, we use Engel’s theorem, that is we will show that for each y € h the 
map ad(y) is nilpotent as an endomorphism of h. To show ad(y) is nilpotent on h, we consider the 
characteristic polynomial of ad(y) on g and h. Since h is a subalgebra of g, the characteristic polynomial 
xX” (t) € k[t] of ad(y) on g is the product of the characteristic polynomials of ad (y) on h and g/b, which 
we will write as x7 (t) and x% (t) respectively. 

We may write x” (t) = 079 ck (y)t" (where n = dim(g)). Pick {h1, ho,...,h,} a basis of b, so that 
we may write y = `; Zihi. Then we may view the coefficients c as polynomials in the coordinates 
{aj :1 < i < r}. Similarly, we have y¥ = X ;—o de(y)t*and x3 = S075 ex(y)t’ where the d;,ej are 
polynomials in the {x; : 1 < i < r}. Now we know that if y = x then ad (x) is invertible on g/b, since it 
has no 0-eigenspace there, so that x3 has e9(x) # 0, and thus the polynomial eo is nonzero. 

Now for each y € h the number min{i : c;(y) # 0} is clearly dim(go,,). However, suppose we write 
x(t) = t° Xo dk+st”, where d, # 0 is a nonzero polynomial in the {z; : 1 < i < r}. Then we would 
have 

X” (t) =O (ds tdszit+...)(eo tertt+...)=tdseg+..., 
hence if we pick z € go, such that d,(z)eo(z) is nonzero, then go,, has dimension s, so by the assump- 
tion that x is regular, we must have s > r. On the other hand s < r since xı has degree r, thus we 
conclude that s = r and y{(t) = t" for all y € go... Hence every ad(y) is nilpotent on go,x, so that go,» is 
a Cartan subalgebra as required. 


In the course of the proof of the above Proposition we used the following fact about the coefficients 
of the characteristic polynomial. It was crucial because, whereas the product of two arbitrary nonzero 
functions may well be zero, the product of two nonzero polynomials (over a field) is never zero. For 
completeness we give a proof’. (To apply it to the above, take V = g, A = h and y = ad). 


Lemma 8.4. Suppose that V and A are finite dimensional vector spaces, p: A — End(V) is a linear map, and 
{a1,42,..., ap} is a basis of A. Let 
d 
Xa(t) = X` c(a) € kit] 
i=0 
be the characteristic polynomial of (a) € A. Then if we write a = Ea zia, the coefficients ci(a) (1 < i < d) 
are polynomials in k|x1, £2,..., £p]. 


Proof. Pick a basis of V so that we may identify End (V ) with Mat, (k) the space of n x n matrices. Then 
each y(a;) is a matrix (a* 


: k 
1“)i<jk<n and if a = J i; vie; we have 


k 


Xa(t) = det(tIn — X zip(a;)), 


i=1 
which from the formula for the determinant clearly expands to give a polynomial in the x; and t, which 
yields the result. 


17Note that this didn’t use the fact that x is regular, hence go,x gives a self-normalizing subalgebra of g for any x € g. 
181¢ this all seems overly pedantic then feel free to ignore it. 
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Remark 8.5. As an aside, there’s no reason one needs to pick a basis of a vector space V in order to talk 
about the space k[V] of k-valued polynomial functions on it. For example, one can define k[V] to be the 
subalgebra of all k-valued functions on V which is generated by V* the space of functionals on V. (This 
is fine if k is algebraically closed at least, if that is not the case then one should be a bit more careful, 
e.g. recall if k is finite, then an element of k[t] is not a function on k). 


9. THE CARTAN DECOMPOSITION 


In this lecture we work over an algebraically closed field k of characteristic zero. 

Our study of the representation theory of nilpotent Lie algebras can now be used to study the struc- 
ture of an arbitrary Lie algebra. Indeed, if g is any Lie algebra, we have shown that it contains a Cartan 
subalgebra 6, and the restriction of the adjoint action makes g into an h-representation. As such it 
decomposes into a direct sum 

g= Pp ga. 


AE (b/Db)* 
The next Lemma establishes some basic properties of this decomposition. 


Lemma 9.1. Let g, b be as above. Then h = go. Moreover, if A, u are one-dimensional representations, then 


[ga gu] E Jatu- 


Proof. Clearly h C go, since h is nilpotent. Consider go/h as an b-representation. By Lie’s theorem it 
contains a one-dimensional submodule L, and since h acts nilpotently on go be definition, h acts as zero 
on L. But then the preimage of L in go normalizes h which contradicts the assumption that h is a Cartan 
subalgebra. 
Applying equation (7.2) with p = ad we see that: 
(ad(x) — (A(x) + u())1)"[y, 2] = $C (") [(ad(x) — A(ax).1)*(y), (ad(x) — w(x).1)"™(z)]. 


i=0 


The containment |g), gu] E ga+, then follows using the same argument as in the first paragraph of 
Theorem 7.4. 


Definition 9.2. By the previous Lemma, if h is a Cartan subalgebra of g then g decomposes into a direct 


sum 
g=bh9 aD ga- 
A#0 

This is known as the Cartan decomposition of g. The set ® of non-zero À € (g/Dg)* for which the 
subspace g) is non-zero is called the set of roots of g, and the subspaces g are known” as the root spaces 
of g. 

Although we will not prove it in this course, any two Cartan subalgebras of g are conjugate by an 
automorphism” of g, that is, given any two Cartan subalgebras hy, h2 there is an isomorphism a: g > g 
such that a(b1) = he, so that the decomposition of g is unique up to automorphisms. 


10. TRACE FORMS AND THE KILLING FORM 


In this section we introduce certain symmetric bilinear forms, which will play an important role in 
the rest of the course. See the Section 22 for a brief review of the basic theory of symmetric bilinear 
forms”. 


Definition 10.1. If g is a Lie algebra, and let (V, p) be a representation of g. we may define a bilinear 
form ty: g xX g > k on g, known as a trace form by setting 


tv(x,y) = trv (p(x) p(y). 


19; ¢. in the terminology for representations of nilpotent Lie algebras discussed above, the roots of g are the weights of g as 


an b-representation. 

20In fact, they are even conjugate by what is known as an inner automorphism. 

2lPart A Algebra focused more on positive definite and Hermitian forms, but there is a perfectly good theory of general 
symmetric bilinear forms. 
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Clearly, ty is symmetric and bilinear, because tr(ab) = tr(ba), for any endomorphisms a, b of V, but 
it also satisfies another important property: it is invariant, that is: 


tv ([z, yl, z) = tv (x, [y, 2]). 
To see this, note that 


ty ([x, y], z) = tr(o([z, y])e(2)) 
= tr(p(x)p(y)(z)) — tr(o(y) p(x) p(z)) 
= tr(p(x)p(y)(z)) — tr(e(x) e(z)p(y)) 
= tr(p(x), p(y, z])) 
= ty (x, [y, z]). 


where in the second line we again used the fact that tr(ab) = tr (ba). 


Definition 10.2. The Killing form K: g x g > kis the trace form given by the adjoint representation, that 
is: 
r(x, y) = tr(ad(x)ad(y)). 
Note that if a C g is a subalgebra, the Killing form of a is not necessarily equal to the restriction of 
that of g. We will write x! when it is not clear from context which Lie algebra is concerned. 


If a is an ideal in g, then in fact the Killing form is unambiguous, as the following Lemma shows. 


Lemma 10.3. Let a be an ideal of g. The Killing form K* of a is given by the restriction of the Killing form K! 
on g, that is: 


Proof. If a € a we have ad(a)(g) C a, thus the same will be true for the composition ad(a;)ad(a2) for 
any a1, @2 € a. Thus if we pick a vector space complement W to a in g, the matrix of ad (aı)ad (a2) with 
respect to a basis compatible with the subspaces a and W will be of the form 


A B 

0 0. 
where A € End(a) and B € Hom,(a,W). Then clearly tr(ad(ai)ad(a2)) = tr(A). Since A is clearly 
given by ad(a1)jaad(a2)ja, we are done. 


The Killing form also allows us to produce ideals: If a denotes a subspace of g, then we will write 
a+ for the subspace 
{x E€ g: (x,y) = 0,Vy € a}. 


Lemma 10.4. Let g be Lie algebra and let a be an ideal of g. Then at is also an ideal of g. 

Proof. Suppose that x € g and z € a+. We need to show that [z, z] € at. But if y € a we have 
K([2x, 2], y) = —K([z, x], y) = —K(z, [x, y]) = 0, 

since [x,y] € a since ais an ideal. Hence |x, z] € a+ as required. 


11. CARTAN CRITERIA FOR SOLVABLE LIE ALGEBRAS 


In this section k is an algebraically closed field of characteristic zero. 
We now wish to show how the Killing form yields a criterion for determining whether a Lie algebra 
is solvable or not. For this we need a couple of technical preliminaries. 


Lemma 11.1. Let g be a Lie algebra and let h be a Cartan subalgebra, with associated Cartan decomposition 
g = b S Oreg gr. Let (V, p) be a representation of g and let V = @ cw Vy be the generalised weight-space 
decomposition of V as an b-representation. Let A € Vand a € ®. Then there isan r € Q such that the 
restriction of À to [ga, G—a] is equal to ra. 


Proof. The set of weights W is finite, thus there are positive integers p,q such that A + ta € W for all 
integers t with —p < t < q but À — (p + 1)a ¢ Y and à + (q + 1)a ¢ Y. Let M = @_p<t<q Vista. If 
z € [ga; 9-a] is of the form [x,y] where x € ga, Y € g-a then since 

p(£)(Va+qa) E Vatta = {0},  p(y)(Va—pa) E Vi-(p4tya = {0} 
we see that p(x) and p(y) preserve M. Thus the action of p(z) on M is the commutator of the action of 


p(x) and p(y) on M, and so tr(p(z), M) = 0. On the other hand, we may also compute the trace of p(z) 
on M directly: 
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0 = tr(p(z), M) 
= J tlo(z), Vara) 


—pstsq 
= X (A(z) + ta(z)) dim(Vasta). 

—PpStsq 
since any h € h acts on a generalised weight-space V,, with unique eigenvalue u(h). Rearranging the 
above equation gives \(z) = ra(z) for some r € Q as required (where the denominator is a sum of 
dimensions of subspaces which are not all zero, and hence is nonzero, and clearly r does not depend 
on 2). 


Proposition 11.2. Let g C gl(V) be a Lie algebra. Suppose that Dg = g. Then there is an x € g for which 
ty (a,x) £0. 

Proof. Suppose that h is a Cartan subalgebra of g, and that g = @ coy {0} 9a is the associated Cartan 
decomposition, where b = go. Similarly let V = © ey Vi be the decomposition of V into generalised 
h-weight spaces as in Theorem 7.4. Now if V = Vo, then every x € h is nilpotent, and hence ad(z) is 
also nilpotent, so that we must have g = go = b. But since Dg = g, so that g is not solvable, this is 
impossible, and hence there is some A € Y with à Æ 0. 


Next observe that 
9=Dg=(a.9=[ PB o, @ o] = dolor g] 


AE BuU{0} we PU{0} AM 


Since we know that [g,, gu] E 9.4, and moreover h = go, it follows that we must have 
b = [b,b] + > [ga 9- 


where the sum runs over those roots a such that —a € ®. But by definition, \ vanishes on Db, so that 
there must be some a € ® with X((ga, g_a]) 4 0. 
Picking x € [ga, g-a] such that A(x) ¥ 0, we find that: 


ty(a,x) = tr(a =) dim(V))A 
NEV 
But now by Lemma 11.1 we know that for any u € W there is an ra „ E Q such that u(x) = ra „-a(x) for 
all x € [ga, g—a]. In particular, 0 4 A(x) = re,,a(zx) so that ra, # 0 and a(x) # 0. Hence we see that 
ty (x, x) = (52 dim(V, Ja(a)?. 
HEY 


Since the terms in the sum are nonnegative, and the term corresponding to A is positive, we conclude 
ty (x, x) # 0 are required. 


Applying the previous Proposition to the Killing form we can give a criterion for a Lie algebra to be 
solvable. 


Theorem 11.3 (Cartan’s criterion for solvability). A Lie algebra g is solvable if and only if the Killing form 
restricted to Dg is identically zero. 


Proof. Since g is solvable if and only if g/3(g) is, we may assume that the adjoint representation embeds 
ginto gl(g). Consider the derived series D*g, (k > 1). If there is some k with D'g = D'+!g = D(D*g) 4 
{0}, then by Lemma 10.3 and Proposition 11.2 applied to g C gl(g), there is an z € D*g with 

K8(a,2) = KP'9 (a, x) #0, 
and hence « is not identically zero. Thus we conclude D*+!g is a proper subspace of D*g whenever 
Dg is nonzero, and hence since g is finite dimensional, it must be solvable as required. 

For the converse, if g is solvable, then by Lie’s theorem we can find a filtration 0 C Fy, C Fy C... C 
F,, = g of ad-stable subspaces with dim(F;) = i. Now if x,y € g the maps induced by ad(x)ad(y) 
and ad(y)ad(x) on F;/Fi—ı are equal, since gl, is commutative. Thus it follows that if z = [x,y] € Dg, 
then ad(z) = [ad(x),ad(y)] maps F; into F;_; and hence for all z € Dg we have ad(z)(F;) C Fi-1. 
But now if 21, z2 € Dg, then ad(zı)ad(z2)(F;) C Fi—2, and hence ad(z1)ad(z2) is nilpotent, and so 
K(21, 22) = tr(ad(z1)ad(z2)) = 0 as required. 
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12. SEMISIMPLE LIE ALGEBRAS, EXTENSIONS AND SEMI-DIRECT PRODUCTS. 


Suppose that g is a Lie algebra, and a and 6 are solvable Lie ideals of g. It is easy to see that a + b is 
again solvable (for example, because 0 C a C a+ b, and a and (a+ 6)/a = b/(aN b) are both solvable). 
It follows that if g is finite dimensional, then it has a largest solvable ideal r (in the strong sense: every 
solvable ideal of g is a subalgebra of r). 


Definition 12.1. Let g be a finite dimensional Lie algebra. The largest solvable ideal t of g is known as 
the radical of g, and will be denoted rad(g). We say that g is semisimple if rad(g) = 0, that is, if g contains 
no non-zero solvable ideals. 


Lemma 12.2. The Lie algebra g/rad(g) is semisimple, that is, it has zero radical. 


Proof. Suppose that s is a solvable ideal in g/rad(g). Then if s’ denotes the preimage of s in g, we see 
that s’ is an ideal of g, and moreover it is solvable since rad(g) and s = s’/rad(g) as both solvable. But 
then by definition we have s’ C rad(g) so that s’ = rad(g) and s = 0 as required. 


Thus we have shown that any Lie algebra g contains a canonical solvable ideal rad(g) such that 

g/rad(g) is a semisimple Lie algebra. We thus have an exact sequence” 

0 —> rad(g) —> g —> g/rad(g) —> 0, 
so that, in some sense at least, every finite dimensional Lie algebra is “built up” out of a semisimple 
Lie algebra and a solvable one. Slightly more precisely, if 

0 gı g 92 0 
is an exact sequence of Lie algebras, we say that g is an extension of gə by gı. Thus Lemma 12.2 can be 
rephrased as saying that any Lie algebra is an extension of the semisimple Lie algebra g/rad(g) by the 
solvable Lie algebra rad(g). 


Definition 12.3. Suppose that g, h are Lie algebras, and we have a homomorphism ¢: g —> Der,(6), 
the Lie algebra of derivations” on h. Then it is straight-forward to check that we can form a new Lie 
algebra h x g, the semi-direct product of g and h by ¢ which as a vector space is just g © h, and where the 
Lie bracket is given by: 


[(r1, Y1), (£2, Y2)] = ([v1, £2] + O(Y1)(@2) — (y2) (£1), [y1; Yel), 


where z1, £2 € b, y1, Y2 € g. This is the Lie algebra analogue of the semidirect product of groups, where 
you build a group H x G via a map from G to the automorphisms (rather than derivations) of H. The 
Lie algebra h, viewed as the subspace {(x,0) : x € b} of b x g, is clearly an ideal of h x g and it is easy 
to check that we then have an extension 


0 — h — b x g —> g — 0. 


Thus semi-direct products give us examples of extensions of Lie algebras. 
In fact, semidirect products correspond to a relatively simple kind of extension. 


Definition 12.4. An extension of Lie algebras 


0—> gı gs— g2 0 
is said to be split if there is a map s: g2 —> g such that q o s = id,,. If such a map exists, then its 
image s(g2) is a subalgebra of g which is isomorphic to gz and is complementary to g4, in the sense that 
g = gı @ g2 as vector spaces. In fact we have g = gi X s(g2) where s(g2) acts by derivations on g; via 
the adjoint action (since gj; is an ideal in g). 
Note that there may be many ways to split an exact sequence of Lie algebras (see the problem sheet). 


In characteristic zero, the extension of Lie algebras given by the rad(g) and g/rad(g) is always split: 


Theorem 12.5. (Levi's theorem) Let g be a finite dimensional Lie algebra over a field k of characteristic zero, and 
let x be its radical. Then there exists a semisimple subalgebra s of g such that g S t x s. 


Note that in particular s is isomorphic to g/rad(g). We will not prove this theorem in this course. 


206 Lie algebras rather than representations, but exactly the same definitions apply in this case also: the image of the 
incoming morphism is the kernel of the outgoing morphism. In particular the zeros at each end record the facts that the map 
from rad(g) to g is injective (since it is an inclusion) and the map from g to g/rad(g) is surjective (since it is a quotient map). 

?3Recall that the derivations of a Lie algebra are the linear maps a: h — b such that a((x, y]) = [a(x), y] + [x, a(y)]. 
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Remark 12.6. (More of a cultural aside.) Note that an extension does not a priori have to split! We saw 
this already for representations of nilpotent Lie algebras: If g = nz the one-dimensional Lie algebra, 
then its natural 2-dimensional representation on k? given by the inclusion nz > glz(k) gives a non-split 
extension 


0 ko ——> k? ko 0 
where ko is the trivial representation, and i: ko — k? is the inclusion t ++ (t,0). In fact it’s easy to see 
using linear algebra that for gl (k) = nz, an extension of one-dimensional representations ka and kg 
automatically splits if a 4 6 while there is, up to isomorphism, one non-split extension of ka with itself 


(a, 8 € (gh (k))*). 


13. CARTAN’S CRITERION FOR SEMISIMPLICITY 


The Killing form also gives us a way of detecting when a Lie algebra is semisimple. Recall that a 
bilinear form B: V x V —> kis said to be nondegenerate if {v € V : Yw € V, B(v,w) = 0} = {0}. To 
prove it we first note the following simple result. 

We have the following simple characterisation of semisimple Lie algebras. 


Lemma 13.1. A finite dimensional Lie algebra g is semisimple if and only if it does not contain any non-zero 
abelian ideals. 


Proof. Clearly if g contains an abelian ideal if contains a solvable ideal, so that rad(g) 4 0. Conversely, 
if s is a non-zero solvable ideal in g, then the last term in the derived series of s will be an abelian ideal 
of g (check this!). 


Theorem 13.2. A Lie algebra g is semisimple if and only if the Killing form is nondegenerate. 


Proof. Let gt = {x € g : k(x, y) = 0,Yy € g}. Then by Lemma 10.4 gt is an ideal in g, and clearly 
the restriction of «x to gt is zero, so by Cartan’s Criterion, and Lemma 10.3 the ideal gt is solvable. It 
follows that if g is semisimple we must have g+ = {0} and hence « is non-degenerate. 

Conversely, suppose that « is non-degenerate. To show that g is semisimple it is enough to show 
that any abelian ideal of g is trivial, thus suppose that a is an abelian ideal. Then if x € a and y € g 
is arbitrary, the composition ad(x)ad(y)ad(a) must be zero, since ad(y)ad(x)(z) € a for any z € g, as 
a is an ideal, and since a is abelian ad(a)(b) = 0 for all a,b € a. But then clearly (ad(y)ad(a))? = 0, 
so that ad(y)ad(z) is nilpotent and «(a,x) = 0 for all x € g. But then a C gt = {0} anda = {0} as 
required. 


14. SIMPLE AND SEMISIMPLE LIE ALGEBRAS 


Definition 14.1. We say that a Lie algebra is simple if it is non-Abelian and has no nontrivial proper 
ideal. We now show that this notion is closed related to our notion of a semisimple Lie algebra. 


Proposition 14.2. Let g be a semisimple Lie algebra, and let I be an ideal of g. Then g = I @ I+. 


Proof. Since g is semisimple, the Killing form is nondegenerate, hence (see for example notes on the 
course webpage) we have 


(14.1) dim(I) + dim(I+) = dim(g). 


Now consider I N I+. The Killing form of g vanishes identically on 7 N I E by definition, and since it is 
an ideal, the Killing form of IN I+ is just the restriction of the Killing form of g. It follows from Cartan’s 
Criterion that I N I+ is solvable, and hence since g is semisimple we must have J N J+ = 0. But then 
by Equation (14.1) we must have g = I 6 I+ as required (note that this is a direct sum of Lie algebras, 
since [7, I+] CIN IŻ). 


Proposition 14.3. Let g be a semisimple Lie algebra. 


(1) Any ideal and any quotient of g is semisimple. 
(2) Then there exist ideals 91, 92,...9x C g which are simple Lie algebras and for which the natural map: 


910 92 ©... O gk > Y, 


is an isomorphism. Moreover, any simple ideal a € g is equal to some g; (1 < i < k). In particular the 
decomposition above is unique up to reordering, and g = Dg. 
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Proof. For the first part, if I is an ideal of g, by the previous Proposition we have g = I @ I+, so that 
the Killing form of g restricted to I is nondegenerate. Since this is just the Killing form of J, Cartan’s 
criterion shows that J is semisimple. Moreover, clearly g/I = I+ so that any quotient of g is isomorphic 
to an ideal of g and hence is also semisimple. 

For the second part we use induction on the dimension of g. Let a be a minimal non-zero ideal in g. 
If a = g then g is simple, so we are done. Otherwise, we have dim(a) < dim(g). Then g = a @ a+, and 
by induction a+ is a direct sum of simple ideals, and hence clearly g is also. 

To show the moreover part, suppose that g = gi ® g2 ©... ® gx is a decomposition as above and a 
is a simple ideal of g. Now as 3(g) = {0}, we must have 0 ¥ [g, a] C a, and hence by simplicity of a it 
follows that [g, a] = a. But then we have 


k 
a= [g, a] = I o:a] = [gi,a] © [g2, a] Q...D [gx a], 


(the ideals [g;, a] are contained in g; so the last sum remains direct). But a is simple, so direct sum 
decomposition must have exactly one nonzero summand and we have a = [g;, a] for some i (1 < i < k). 
Finally, using the simplicity of g; we see that a = [g;, a] = g; as required. To see that g = Dg note that it 


is now enough to check it for simple Lie algebras, where it is clear™*. 


15. WEYL’S THEOREM 


In this section we assume that our field is algebraically closed of characteristic zero. 

In this section we study the representations of a semisimple Lie algebra. Recall from Section 4 
the definition of an irreducible representation. The Appendix in Section 23 reviews the other basic 
representation theory that we will need in this section. Our goal is to show, just as for representations 
of a finite group over C, that every representation is a direct sum of irreducibles. 

If (V,e) and (W, c) are representations of g, then a linear map ¢: V — W is a homomorphism of 
representations if 

$(o(2)(v)) = o(2)(o(v)) Vr eg, v eV. 
We will write Hom,(V,W) for the space of homomorphisms from V to W. To make the notation less 
cluttered, where there is no danger for confusion we will often suppress the notation for the map 
p: g — gl(V), so that, for example, the condition for ¢ to be a homomorphism will be written simply 
as 4(2(v)) = 2(d(v)). 

The following Lemma was on a problem sheet, and is proved in the appendix (the proof is identical 
to the corresponding results for representations of groups). For it we suppose that our field k is algeb- 
raically closed (but not necessarily of characteristic zero). 


Lemma 15.1. (Schur’s Lemma). Let (V, p) and (W, o) be irreducible representations of g. Then a g-homomorphism 
$: V > W is either zero or an isomorphism. Moreover, if k is algebraically closed, then Hom(V, W) is one- 
dimensional. 


Definition 15.2. We say that a representation V of a Lie algebra is semisimple if any subrepresentation 
has a complement, that is, if U is a subrepresentation of V then there is a subrepresentation W such 
that V =U ẹ W. 


Lemma 15.3. Let V be a representation of g. Then we have 


i) V is semisimple if any surjection of representations q: V — W has a right inverse. 
ii) If the representations of g are semisimple then they are completely reducible. 


Proof. For the first part, if U is a subrepresentation of V, then consider the quotient map q: V > V/U. 
If s is a right inverse for q, then we claim W = im(s) is a complement to U. Certainly if w € U N T the 
q(w) = 0, since U is the kernel of g. But then as w € T we have w = s(u) for some u € V/U, and hence 
u = q(s(u)) = q(w) = 0, so that w = s(0) = 0. Hence U and T form a direct sum, and since s is clearly 
injective, by dimension it U © T = V as required. 

For the second part, use induction on dim(V). If V is irreducible then we are clearly done, otherwise 
V has a proper subrepresentation U. But then U has a complement in V, say V = U @ T. But since 
dim(U), dim(T) < dim(V), they are completely reducible, hence V is completely reducible as required. 


24This is one reason for insisting simple Lie algebras are nonabelian. 
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Remark 15.4. In the problem sets it is checked that if a representation is semisimple then any subrepres- 
entation and any quotient representation of it is also semisimple. Knowing this, the above proof shows 
that if a representation is semisimple then it is completely reducible (whereas in the above we showed 
that if all representations of a Lie algebra g are semisimple, then they are all completely reducible.) 


Theorem 15.5. (Weyl’s Theorem). Let V be a finite dimensional representation of a semisimple Lie algebra over 
an algebraically closed field k of characteristic zero. Then V is semisimple, and hence completely reducible. 


Lemma 15.3 shows that the proof of Weyl’s theorem reduces to showing that if q: V > W is a 
surjective map of g-representations, then q has a right inverse s: W — V. (Such a map s is also often 
called a section of q, and we say that s splits the map q.) 


15.1. Casimir operators. Recall from Section 10 the definition of the trace form ty associated to a rep- 
resentation (V, p). 


Lemma 15.6. Suppose that g is semisimple and (V, p) is a representation of g. Then the radical of ty is precisely 
the kernel of p. In particular if (V, p) is faithful then ty is nondegenerate. 


Proof. The image p(g) C gl(V) of g is a semisimple Lie algebra (since g is) and the statement of the 
Lemma is exactly that ty is nondegenerate on p(g). But the radical t = rad(ty) is an ideal of p(g). Now 
Proposition 11.2 shows that if we let (D*t);,>o be the derived series of t, we must have D't!t Ç D*t 
whenever D*t # {0}, thus r must be solvable. Since p(g) is semisimple, this forces the radical to be 
zero as required. 


The next ingredient we need is the Casimir operator. Given (V,p) a representation, pick a basis 
{%1,@2,...,%n} of p(g), and let {y1, y2,..., Yn} be the dual basis of p(g) with respect to ty so that 
try (xiyj) = ĝi; (such a basis exists because the trace form ty for V thought of as a faithful representa- 
tion of p(g) is nondegenerate by the previous Lemma). Then let 


Cy = 2 p(x) p(y) 


Lemma 15.7. Suppose that g is a semisimple Lie algebra and V is a representation of g. The element Cy € gl(V) 
is an endomorphism of (V, p) which does not depend on the choice of basis {x1,...,%n}. Moreover tr(Cy) = 


dim(p(g)). 


Proof. The statement of the Lemma only uses the image of g in gl(V), so we may assume that g is a 
subalgebra of gl(V). Let z € g and consider [Cy , z]. Then we have: 


n 
[Cv,z] = 5 LEYiZ — ZTiyi 
i=1 
n 
= $ millu 2] + zy) — (2, 26] + wiz) 
i=1 


= X milu z] — [z zilvi 
=r 


Now if [y, 2] = Yi aijy;, using the fact that the bases {x;}/_, and {y;}7_, are dual we see that 
aij = ty([y:, z], xj) and hence since ty is invariant this is equal to ty (y:, [z, 2;]), which in turn is equal 
to bj; where [z, xj] = X ;—1 bjixi. It follows immediately that [Cy , z] = 0 as required. 

Finally, note that: 


tr(Cy) = (> Liyi) = ys tr(ziyi) = oS 1 = dim(p(g)) 


Remark 15.8. It is straight-forward to check that the Casimir is independent of the choice of basis 
{£1,..., £n} of g. Indeed if {z1,...,z,} is a different basis, then we may write z; = Say Cj; for 
some scalars c;;. Then if {t;}1<i<n is the dual basis for {z;}1<i<n and we write t; = er dizy; then 
setting C = (cij) and D = (d;;j) the condition that {t;}/_, is the basis dual {z;}/_, is precisely that 
Dt = C~!. and this immediately implies that >", p(zi)p(ti) = X; p(vi) p(y). 
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Recall that if (V, p) is a representation of g, then V! = {v € V : p(x)(v) = 0, Va € g} is the subrep- 
resentation of invariants in V. We also will need g.V = span{p(x)(v) : x € g,v € V}. Itis clearly a 
subrepresentation of V. 


Lemma 15.9. Let (V, p) be representation of a semisimple Lie algebra g. Then V = V! @g.V. Moreover, if 
q: V > W isa surjective homomorphism, then q(V8) = WS. 


Proof. We prove the statement by induction on dim(V) (the case dim(V) = 0 being trivial). If V = V8 
the certainly g.V = {0} and the statement holds. Thus we may assume that V 4 V”, so that p(g) 4 {0}. 
Let Cy be the Casimir operator of V. Since it is a g-endomorphism, if V = @ Vy is the decomposition 
of V into the generalised eigenspaces of Cy, each V; is a subrepresentations of V. Since if the statement 
of the Lemma holds for representations U and W it certainly holds for their direct sum U © W, we are 
done by induction unless Cy has exactly one generalised eigenspace, i.e. V = Vy. But then dim(V).A = 
tr(CV) = dim(p(g)), so that A 4 0, and hence Cy is invertible. Since it is clear from the definition of Cy 
that V! C ker(C\) we see that V? = {0}, and moreover V = Cy (V) C p(g)(p(g)(V)), so that V = g.V, 
and we are done. 

For the last part, note that if q: V — W is a homomorphism of representations, it is clear that 
q(V2) C W! and q(g.V) C g.W. It follows that if q is surjective we must have both containments being 
equalities. 


Theorem 15.10. (Weyl’s theorem) Let g be a semisimple Lie algebra, and let q: V — W be a surjective g- 
homomorphism. Then q has a right inverse, and hence every g-representation is semisimple. 


Proof. Consider the natural inclusion 
qx: Hom(W, V) > Hom(W, W), 


given by s ++ qo s. It is easy to see that q, is a surjective map of representations of g. Now clearly 
Hom(W,W) contains a copy of the trivial representation of g, given by the scalar multiples of the 
identity. It follows from Lemma 15.9 that Hom(W, V) must have a copy of the trivial representation 
mapping to this subrepresentation. But this means there is a g-invariant linear map s: W — V such 
that g..(s) = qos = idw, i.e. the map q has a splitting. The fact that V is semisimple now follows from 
Lemma 15.3. 


Remark 15.11. The complete reducibility of representation of a semisimple Lie algebra also now follows 
from Lemma 15.3. In fact, though we do not need it, a completely reducible representation is always 
semisimple (see the appendix on representation theory) so the apparently stronger statement that any 
g-representation is semisimple is actually equivalent to the statement that any g-representation is com- 
pletely reducible. 


16. THE JORDAN DECOMPOSITION 


If V is a vector space and x € End(V), then we have the natural direct sum decomposition of V into 
the generalized eigenspaces of x. This can be viewed as giving a decomposition of the endomorphism 
x in a semisimple (or diagonalisable) and nilpotent part, as the next Lemmas show. 


Lemma 16.1. If x,y € End(V) are commuting linear maps then if both are nilpotent, so is x + y, and similarly 
if both are semisimple, so is x + y. 


Proof. For semisimple linear maps this follows from the fact that if s is a semisimple linear map, its 
restriction to any invariant subspace is again semisimple. For nilpotent linear maps it follows because 


n z 
wear È (i)er 
k=0 


so that if n is large enough (e.g. n > 2dim(V)) each of these terms will be zero (since x and y are 
nilpotent). 


Proposition 16.2. Let V be a finite dimensional vector space x € End(V). Then we may write x = £s + £n 
where xs where xs is semisimple and x,, is nilpotent, and x, and x», commute, i.e. |£s, £n] = 0. Moreover, this 
decomposition is unique, and if U is a subspace of V preserved by x, it is also preserved by £s, Xn 
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Proof. Let V = yep Va be the generalised eigenspace decomposition of V, and let pa: V —> Vy be the 
projection with with kernel @ nen Va. If we set x, to be X`, A.py, clearly x, and x commute, and their 
difference x, = x — x, is nilpotent. This establishes the existence of the Jordan decomposition. 

To see the uniqueness, suppose that x = s+n is another such decomposition. Now since s commutes 
with z, it must preserve the generalised eigenspaces of x, and so, since x, is just a scalar on each Vy, 
clearly s commutes with x. It follows s and n both commute with x, and zn. But then by Lemma 16.1 
£s — sand n — x, are semisimple and nilpotent respectively. Since s + n = £s + £n they are equal, and 
the only endomorphism which is both semisimple and nilpotent is zero, thus s = x, and n = £n as 
required. 

Finally, to see that x, and £n preserve any subspace U which is preserved by x, note that if U = 
ier Ua is the decomposition of U into generalised eigenspaces of x, then clearly Uy C Vy, (VA € k) 
and since x, is a scalar on V; it certainly preserves Uy, and hence all of U. As £n = x — x, clearly £n 
also preserves U. 


Lemma 16.3. Let V be a vector space and x € End(V). If x is semisimple then 
ad(x): End(V) — End(V) 


is also semisimple, and similarly if x is nilpotent. In particular, if £ = xs + &n is the Jordan decomposition of 
x, then ad(x) = ad(x,) + ad(xn) is the Jordan decomposition of ad(x). In other words, ad(x), = ad(xs) and 
ad(£)n = ad(Xp). 


Proof. If x is semisimple, then let {e; : 1 < i < n} be a basis of eigenvectors, so that x(e;) = Aje; say 
(thus we are not assuming the A; are distinct here). It is easy to check that if e;; denote the matrix 
coefficients with respect to this basis (i.e. e;;(v) = ež (v)e; where {ež : 1 < i < n} is the dual basis of 
{ei : 1 < i < n}) then ad(x)(e;;) = (Ai — Aj )exj, so that {e;;,1 < i, j < n} is a basis of eigenvectors for 
ad(a) and hence ad() is semisimple. 

If x is nilpotent, then ad(z) = A, — pz where àz and py denote left and right multiplication by zx. 
Since A; and —p, are clearly nilpotent if x is, and evidently commute, ad(x) is nilpotent by Lemma 
16.1. 


We now return to Lie algebras. The above linear algebra allows us to define an “abstract” Jordan 
decomposition for the elements of any Lie algebra (over an algebraically closed field). 


Definition 16.4. Suppose that g is a Lie algebra and x € g. The endomorphism ad(x) € gl(g) has a 
unique Jordan decomposition ad(x) = ad(x), + ad(z), in gl(g). Then if s,n € g are such that ad(s) = 
ad(z), and ad(n) = ad(z)n, we say the Lie algebra elements s,n are an abstract Jordan decomposition of 
ax. 


Note that that if g = g!(V) for some vector space V, then Lemma 16.3 shows that the abstract Jordan 
decomposition for an element x € glI(V) is just the naive one (i.e. the one for x thought of as a linear 
map from V to itself). 


16.1. The abstract Jordan decomposition and derivations of semisimple Lie algebras. Unless expli- 
citly stated to the contrary, in this section we work over a field k which is algebraically closed of characteristic 
zero. 

Let g be a Lie algebra. Then Der,(g) the Lie algebra of k-derivations of g is a Lie algebra, which 
we may view as a subalgebra of the Lie algebra gI(g). The map ad: g > gl(g) is in fact a Lie algebra 
homomorphism from g into Der;(g). Its image is denoted Inn,(g). 


Lemma 16.5. Let a be a semisimple Lie algebra. 
(1) Suppose that a is an ideal of a Lie algebra g. Then there is a unique ideal I in g such that g is the direct 
sum of ideals a @ I. 
(2) All derivations of a are inner, that is, Der,(a) = Inn,(a). 


Proof. For the first part, consider the Killing form « of g. Let x be the Killing form of g and let at = 
{x € g : K(a,a) = 0,Va € a}, then a+ is an ideal in g. Now aN at is an ideal of g on which the 
Killing form vanishes, so that by Cartan’s Criterion” it is solvable. But since it is also an ideal of the 
semisimple Lie algebra a, it follows that aN a+ = {0}. Now since « is nondegenerate on a, we also have 


Again we use the fact that the Killing form on an ideal is the restriction of the Killing form for the whole Lie algebra. 
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dim(a) + dim(at) = dim(g) so that g = a @ at as required”. For uniqueness, note that if g =a@ I, isa 
direct sum of ideals, then [a, 7] = 0 and so clearly J C at. 

For the second part, note that the Lie algebra of derivations D = Der,;(a) is a subalgebra of gl(a) 
containing the image J of ad as the subalgebra of “inner derivations” which, since it is isomorphic to 
a, is semisimple. We first claim that this subalgebra is an ideal: indeed if ad(x) is any inner derivation, 
and 6 € D, then 


lô, ad (x)](y) = d[x, y] — [x, 6(y)] 
= [d(x), y] 
= ad(ô(x)) (y). 
thus [ô,ad(x)] € I, and hence I is an ideal in D. Now since I is semisimple, by the first part we see 
that D = I @ I+, thus it is enough to prove that J+ = {0}. Thus suppose that 6 € I+. Then since 
[I, I+] c IN I+ = {0} we see that 
[ô,ad(x)] = ad(d(x)) = 0, Vx € a, 
so that, again by the injectivity of ad, we have ô = 0 and so I+ = {0} as required. 


Lemma 16.6. Let a be a Lie algebra and Der,(a) C gl(a) the Lie algebra of k-derivations on a. Let 6 € Der,(a). 
If 5 = s + nis the Jordan decomposition of ô as an element of g\(a), then s,n € Der,(a). 


Proof. We may decompose a = @, a, where a) is the generalized eigenspace of x with eigenvalue 
A € ksay. If x € ay and y € a,, then an easy induction show that 
n L n r HaT 
6- atale =D (7) -A0 - wr 
r=0 
hence [x,y] € 4)+,. It follows immediately that s is a derivation on a, and since n = ô — s we see that 
n is also. 


Theorem 16.7. Let g be a semisimple Lie algebra. Then given any x € g has an abstract Jordan decomposition: 
that is, there exist unique elements s,n € g such that x = s + n and |s, n] = 0, and ad(s) is semisimple, while 
ad(n) is nilpotent. 


Proof. As noted above, since g is semisimple, ad: g + gl(g) is an embedding, and the conditions on s 
and n show that if they exist, they must satisfy ad (s) = ad(x), and ad(n) = ad(x)n, where ad(x) = 
ad(x), + ad(z),, is the Jordan decomposition of ad(x) € gl(g). Thus it remains to show that ad(z), 
and ad(a),, lie in the image of ad. But ad(x) acts as a derivation on I = im(ad), so by Lemma 16.6 
so do ad(x), and ad(zx),,. But then by Lemma 16.5, we see that ad(x), = ad(s) for some s € g and 
ad(x)n = ad(n) for some n € g. The conditions on s,n € g then follow from the injectivity of ad, and 
we are done. 


Remark 16.8. One can show that the Jordan decomposition is compatible with representations, in the 
sense that if (V, p) is a representation of a semsimple Lie algebra g and x = s + n is the Jordan decom- 
position of x € g, then p(x) = p(s) + p(n) is the (naive) Jordan decomposition of p(x) € gl(V). The 
main point, of course, is to show that p(s) and p(n) are, respectively, semisimple and nilpotent endo- 
morphisms of V. The proof, which we shall not give here, is similar to the proof of the existence of the 
abstract Jordan form, except that one also needs to use Weyl’s theorem (which we will prove later in 
the course!) 


17. THE CARTAN DECOMPOSITION OF A SEMISIMPLE LIE ALGEBRA 


In this section we work over an algebraically closed field k of characteristic zero. 

Although the Cartan decomposition makes sense in any Lie algebra, we will now restrict attention 
to semisimple Lie algebras g, where we can give much more precise information about the structure of 
the root spaces than in the general case. 


Proposition 17.1. Suppose that g is a semisimple Lie algebra and is a Cartan subalgebra. 


(1) Let « be the Killing form. Then &(gx, gu) = 0 unless A + u = 0. 
(2) Ifa € ®the-ae ®. 


26To see the claim about dimensions, note that « gives a map 0: g —> a*, where 6(x)(a) = (x,a) for x E€ g,a € a. Clearly 
ker(9) = a+. Since « is non-degenerate on a, the map @ is injective when restricted to a, and hence it is surjective. But then it’s 
kernel has the required dimension. 
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(3) The restriction of x to h is nondegenerate. 


Proof. For the first part, from the previous Lemma we know that [g,, gu] © gx+p. Thus if £ € g), y € 
Gu, We see that ad(x)ad(y)(g.) E gà+u+v. But then picking a basis of g compatible with the Cartan 
decomposition it is clear the matrix of ad (x)ad (y) will have no non-zero diagonal entry unless åA+4 = 0, 
hence x(x, y) = 0 unless A + u = 0 as required. 

For the second part, recall that if a is a root, then a Æ 0 and ga # 0. If —a ¢ © then g-a = 0 and so 
gt = g, which is impossible since « is nondegenerate. 

For the third part note that h+ contains all the ga for a € © by part (1). Since « is nondegenerate, by 
dimension counting this must be equal to h+. It follows that xj must be nondegenerate as claimed. 


Lemma 17.2. Let g be a semisimple Lie algebra and h a Cartan subalgebra. Then h is abelian. Moreover, all the 
elements of h are semisimple. 


Proof. We show that [b, b] = Dh = 0. By part (3) of Proposition 17.1 it is enough to show that Db lies in 
the radical of xip. Suppose that x, y € h. Then we claim that: 


(17.1) w(x, y) = tr(ad(x)ad(y)) = $ dim(gy)A(z)A(y). 
rE 

Indeed each g, (for A € ® U {0}) is an h-subrepresentation, so we may compute the trace of ad(x)ad(y) 
on each in turn and then add the resulting expressions. Now § is solvable, so we may find a basis for 
ga with respect to which each ad(z) has an upper triangular matrix, and since ad(x) has the unique 
eigenvalue A(x) on gy, the diagonal entries of the matrix must all equal A(x). It follows that the matrix 
of ad(x)ad(y) is upper triangular with diagonal entries \(x)A(y), and so summing over A € ®U {0} we 
obtain Equation (17.1). Now åA is a one-dimensional representation of h, so it vanishes on Db. It is then 
immediate from Equation (17.1) that x(x, y) = 0, and so x € rad (sip) = {0} as claimed. 

Next suppose that x € h has Jordan decomposition x = s + n in g. Now ad(s),ad(n) must preserve 
h since ad(x) does, and thus since b is self-normalising, we see that s,n € h. Thus it is enough to show 
that h contains no nilpotent elements. But if n € h is nilpotent, then ad(n) is nilpotent on g, and hence 
on each gy. But then 0 = tr(ad(n)),,) = dim(g,)A(n), so that A(n) = 0. But then we see from Equation 
(17.1) that n is in the radical of xp, so n = 0 as required. 


Remark 17.3. Since the restriction of «x to h is non-degenerate, it yields an isomorphism from b* to b, 
indeed if À € h* then there is a unique t, € h such that «(t), y) = A(y) for all y € b, and the assignment 
A > ty is clearly linear. (See the notes on bilinear forms for more details.) 


Remark 17.4. Some textbooks study semisimple Lie algebras g via maximal toral subalgebras. These are 
subalgebras of g consisting entirely of semisimple elements, maximal with this property. It follows 
readily from the above Lemma and the Cartan decomposition that Cartan subalgebras of semisimple 
Lie algebras are maximal toral subalgebras, though we will not use this fact. 


Proposition 17.5. Let g be a semisimple Lie algebra and let h be a Cartan subalgebra, and g = b Paca Ga the 
associated Cartan decomposition. 


(1) [ft € ga, y E gg and h € h then 
K(h, [a, y]) = a(h)K(a, y). 
(2) The roots a € ® span b*. 
(3) The subspace ha = [Sa;9—a] C b is one-dimensional and a(ha) 4 0. 


(4) Ifa € ®, we may find ea € Ga, fa E€ Ga and ha €E hq so that the map e œ ea, f œ> fa and h> ha 
gives an embedding slz > ga ® ba © 9-a- 


Proof. For (1) we have 


n(h, [x,y]) = K((h, x], y) = K(a(h)a,y) = alh)e(x, y), 
as required. 

For (2), suppose that W = span{®}. If W is a proper subspace of h*, then we may find an h € h such 
that a(h) = 0 for all a € ®. But then it follows from our formula for the Killing form in terms of the 
Cartan decomposition that (h, x) = 0 for all x € h, which contradicts the nondegeneracy of the form 
Kip: 

For (3), as in the remark above, since «jy is nondegenerate it yields an isomorphism h* — b, given 
by A+ ty where (t), h) = A(h) for all h € b. Since we know that ® spans h", it follows that {ta : a € ®} 
spans h. Suppose that £ € ga, y E 9-a. Then by (1) we see that |x, y] = s(x, y)ta, so that ha C span{ta}. 
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Since « is nondegenerate on ga © g-a we may find x E€ ga,y E g—a such that «(z,y) # 0, hence 
ha = span{t,} as required. 

Next we wish to show that a(h.) 4 0. For this note that if a(h.) = 0 then pick x € ga, Y E g-a SO 
that z = [x,y] € bq is nonzero. Then |z, x] = a(z)z = 0 = —a(z)y = [z, y], so that a = k-span{z, y, z} 
is a solvable subalgebra of g. In particular, by Lie’s theorem we may find a basis of g with respect to 
which the matrices of ad(a) act by upper triangular matrices, and so ad(z) = ad (|x, y]) acts by a strictly 
upper triangular matrix, and hence is nilpotent. Since we also know z € h we have ad(z) is semisimple, 
hence ad(z) is both semisimple and nilpotent, which implies it is zero, contradicting z # 0. 

Given a(ba) # 0, it is clear that there is a unique ha E€ ba such that a(ha) = 2. Next if ea € ga is 
nonzero, then using the nondegeneracy of « and part (1) we can pick fa E€ g-a so that [ey, fa] = ha It 
is easy to check that {ea, fa, ha} span an copy of slz in g which establishes (4). 


Remark 17.6. A triple of elements {e, f, h} in a Lie algebra g which obey the relations of the standard 
generators of sl» (that is, [e, f] = h, [h,e] = 2e, [h, f] = 2f) is called an slp-triple. Given a € ®, the copy 
of sl, attached to a in the previous proposition will be denoted sly. 


Lemma 17.7. Let g be a semisimple Lie algebra and h a Cartan subalgebra with Cartan decomposition g = 
b Daco Ga: Then 

e The root spaces gq are one-dimensional. 

e Ifa € and ca € Ẹ for some c € Z then c = +1. 


Proof. Choose a nonzero vector ex € ga- Then as above we may find an element e—a € g_q such that 
lea, €a] = Ra € b (since « restricted to ga © g—a is nondegenerate). Consider the subspace: 


M = k.ea @ kta ® ae Bie 
p<0 


(this is a finite direct sum as g is finite-dimensional). Then since ad(ea)(ea) = 0, and [ga, 9-a] = kha, 
and [ea, ha] = 2ea, it is easy to see that M is stable under ea, €—a and ha. We commute the trace of ha 
on M in two ways: on the one hand, it a commutator and so has trace zero. On the other hand it acts 
semisimply on each of the direct sums defining M, so that 


0 = tr(ha) = a(ha) + X dim(gpa)-Ppa(ha) 


= a(ha)(1 — X` p. dim(g-pa). 
p>0 


Since we know that a(ha) 4 0, the only way the above equality can hold is if dim(gpa) = 0 for p > 1 
and dim( (g-a) = 1. Since —a € © if and only if a € ©, this completes the proof. 


We can refine somewhat the structure of the Cartan decomposition we have already obtained, using 
the same techniques. Suppose that a, 8 are two roots in g such that 6 # ka for k € Z. Then we may 
consider the roots which have the form a + k£. Clearly, since g is finite dimensional, there are integers 
p,q > 0 such that a + k8 € ® for each k with —p < k < q, but neither —(p + 1)a nor (q + 1)a are not in 
®. This set of roots is called the a-string through £. 

Proposition 17.8. Let 6 — pa,..., 8B + qa be the a-string through 6. Then we have 
2k (te, ta) 
he) = K(ha, ts) = ——_* =p- q. 
Bite) — testa) Se ey Pe 
In particular 8 — B(hq).a € ®. Moreover, if a € ® and c € k has ca € ® then c € {£1}. 


Proof. We consider the subspace M = P-p<k<q Ja+kg- Pick ea € gq and e—a E g-a such that 0 + 
lea, €—a] = ha and so that {ea, €—a, Rha} form the standard generators of slz as above. It is clear that 
ea, ha, €—a preserve M, so we trim(ha) = 0, and so, using the fact root spaces are 1-dimensional, we 
have the identity: 
© (B+ ka)(ha) =0, 
—psk<q 
and so 
(alq + 1)/2 — p(p + 1)/2)a(Ra) + (p +q + 1)B(Ra) = 9, 
and so since p+q+1#0and a(ha) = 2, we obtain: 


Blha) =P -4. 
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as required. Since 8 + (p — q)a is certainly in the a-string through £ it follows that 6 + 6(ha).a € ®. 


For the second part, taking 8 = ca we see that 2c = p — q € Z, and soc = $(p — q). But now if p — q 


is even we can immediately conclude as before that c = +1. On the other hand, if p — q is odd, the 
a-string through 6 = Da has the form: 
-(p +4) (p— 4) (p+4q) 
5 Oss 5 Qpiess 5 Q, 


which clearly then contains $a so that $a € ®. But then we get a contradiction as a = 2($a). 


Remark 17.9. In fact, ifa,8 € ® and a +8 € ®, then [ga, 98] = ga+s. Indeed let slz act on g via the triple 
{ea; ha, €—a } as in the proof above. Then the slg-representation M = @_ p<k<q Goths is easily seen to 
be an irreducible representation (because each ha weight space is one-dimensional and the eigenvalues 
all have the same parity as (ha). Then the explicit description of the irreducible representations of sl2 
worked out in the problem sheet shows that ea is injective except on the subspace gg+qa, and so in 
particular on gg, hence [ga, gg] is nonzero and hence all of ga+,. 


18. h AND INNER PRODUCT SPACES: 


In this section we work over an algebraically closed field k of characteristic zero. 

Recall that since «jy is non-degenerate, it gives an isomorphism 0: h* — h. For A € b*, we write ¢) for 
6(A), so that K(t,, h) = A(h), (VA € b*, h € b). Given a root a € Ẹ, we have seen that ga D g-a ® [Gas 9-a] 
span a subalgebra isomorphic to slp. We will denote this subalgebra as sla. We note the following 
simple lemma. 


Lemma 18.1. Let a € ®. Then if {ea, fa, ha} are a standard basis for sla then we have 
(1) 
ta 2ta 


ha = — 2, h= 
Klas fa) 


(taste) 
Moreover k(te,ta)-K(Rha; ha) = 4. 
(2) Ifa, 8 € ® then k(ha, hg) € Zand K(ta,tg) E Q. 


Proof. The first equality follows immediately from the results of the previous lecture. For the second, 
note that 
2 = a(ha) = K(ta, ha) = K(ta, Klea, falta) = Klta, ta)K (ea; fa), 
and substitute into the first expression. The last expression follows from calculating K(hq, ha) using 
the second expression. 
Using the Cartan decomposition to compute «(z,y) for x,y € h we see that (since we now know 
that root spaces are one-dimensional) 


K(Ras he) = >> y(a)7(8) € Z. 


yE 


Finally, the first part of the Lemma now immediately gives (ta, tg) € Q. 


Let (—, —) denote the bilinear form on h* which is obtained by identifying h* with b: that is 


(A, u) = (ta, ty): 
Clearly it is a nondegenerate symmetric bilinear form, and via the previous Lemma, (a, 3) = K(ta,tg) € 
Q for alla, 6 € ®. 


Lemma 18.2. The Q-span of the roots ® is a Q-vector space of dimension dim,(h*). 


Proof. We know that ® spans h*, so we may pick a subset {a 1, @2,...,a,} which forms a k-basis of h*. 
To prove the Lemma it is enough to show that every 3 € © lies in the Q-span of the {a; : 1 <i < J}. 
But now if we write 3 = Da c;a; for cj € k, then we see that (a;, 8) = Di (ai a,;)c;. But the matrix 
C = (ai, a;);,; is invertible since (—, —) is nondegenerate, and its entries are in Q hence so are those of 
C—'. But then we have (c;) = C~'((a;, 8)), and the objects on the right-hand side all have Q-entries, 
so we are done. 


Let 5G denote the Q-span of the roots. Although you are perhaps more used to inner product spaces 
over R or C, the definition of a positive definite symmetric bilinear form makes perfectly good sense 
over Q. We know show that (—, —) is such an inner product on ho. 


Lemma 18.3. The form (—, —) is positive definite on ho. 
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Proof. Using the root space decomposition to compute «x we have 
(A, A) = K(ta,ta) = X alta)? = So (a, A)? > 0, 
ac ac 


and since we may have equality if and only if (a, A) = 0 for all a € ®, and the elements of ® span h* it 
follows that the form is definite as required. 


19. BASES FOR ROOT SYSTEMS 


In this section we study the geometry which we are led to by the configuration of roots associated to 
a Cartan decomposition of a semisimple Lie algebra. These configurations will turn out to have a very 
special, highly symmetric, form which allows them to be completely classified. 

We will work with an inner product space”, that is a vector space equipped with a positive definite 
symmetric bilinear form (—,—). Such a form makes sense over any field which has a notion of positive 
elements, and so in particular over R and Q. Since the roots ® associated to a Cartan decomposition 
of a semisimple Lie algebra naturally live in the Q-inner product space ho, we will assume our field is 
Q unless otherwise stated. We let O(V) denote the group of orthogonal linear transformations of V, 
that is the linear transformations which preserve the inner product, so that g € O(V) precisely when 
v,w E V then (v, w) = (g(v), g(w)) for allv,w € V. 


Definition 19.1. A reflection is a nontrivial element of O(V) which fixes a subspace of codimension 1 
(ie. dimension dim(V) — 1). If s € O(V) is a reflection and W < V is the +1-eigenspace, then L = W+ 
is a line preserved by s, hence the restriction s);, of s to L is an element of O(L) = {+1}, which since s 
is nontrivial must be —1. In particular s has order 2. If v is any nonzero element of L then it is easy to 
check that s is given by 


2 

(u,v) 

(v, v) 

Given v # 0 we will write s, for the reflection given by the above formula, and refer to it as the 
“reflection in the hyperplane perpendicular to v”. 


s(u)=u-— 


We now give the definition which captures the geometry of the root of a semisimple Lie algebra. 


Definition 19.2. Let V be a Q-vector space equipped with an inner product (—,—). A finite subset 
® C V\{0} is called a root system if it satisfies the following properties. 

(1) ® spans V; 

(2) Ifa € @ then ca € Ẹ if and only if c = +1; 

(3) Ifa € @ then sa: V — V preserves ®; 

(4) Ifa, 6 € ® then 2(8,a)/(a,a) € Z. 


If (—, —) denotes the inner product on 6g, then we have shown that the roots in h* form a root system 
in the above sense: one simply has to translate the information about the elements ta € h obtained in 
our analysis of the Cartan decomposition: the crucial fact is that, for the roots arising from a semisimple 
Lie algebra we have 


2(6, a) 2ta 


CO Mar 
We established that 6(h,) € Z using root strings, and moreover the properties of root strings also 
showed that sa(G) = 6 — B(ha).a E€ ®, hence properties 3) and 4) hold. Since 1) and 2) were also 
explicitly checked, we see that the roots ® in hg do indeed form a root system. 
Remarkably, the finite set of vectors given by a root system has both a rich enough structure that 
it captures the isomorphism type of a semisimple Lie algebra, but is also explicit enough that we can 
completely classify them, and hence classify semisimple Lie algebras. 


= B(hg) € Z. 


Definition 19.3. Let (V, ®) be a root system. Then the Weyl group of the root system is the group 
W = (sa : a € ®). Since its generators preserve the finite set @ and these vectors span V, it follows 
that it is a finite subgroup of O(V). 


27That is, one with a notion of distance and angle. Apart from working over Q rather than R, this is pretty much the vector 
geometry of Geometry I. 
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Example 19.4. Let g = sln, Then let 0,, denote the diagonal matrices in gl, and h the (traceless) diagonal 
matrices in sl,,. As you saw in the problem sets, h forms a Cartan subalgebra in sI,. Let {e; : 1 <i < n} 
be the basis of 0* dual to the basis {F;; : 1 < i < n} of dn in gl,,. Then ho is the quotient space 


n 
bo = {> ciei : ci € Q}/{Q.(e1 +... + En}, 
i=1 
the roots in hġ are the (images of the) vectors {e; — £; : 1 < i,j < n,i # j}. The Weyl group W in this 
case is the group generated by the reflections sa which, for a = £; — £; interchange the basis vectors ¢; 
and ¢j, so it is easy to see that W is just the symmetric group on n letters. 


A key step in the classification process is to find a good basis for V: we assume that © spans V, so 
certainly we may find a subset of 6 which is a basis, but it turns out we can find a class of particularly 
well adapted bases. 


Definition 19.5. Let (V, ®) be a root system, and let A be a subset of ®. We say that A is a base for ® if 
(1) Aisa basis for V. 
(2) Every 6 € A can be written as > 
sign. 


acA Ca Where ca €E Z and the non-zero cas all have the same 


Given a base of ® we may declare a root positive or negative according to the sign of the nonzero 
coefficients which occur when we write it in terms of the base. We write ®* for the set of positive roots 
and -` for the set of negative roots. 


We now show that any root system has a base exist. The first crucial point is that the angle between 
roots are very constrained. For convenience we will write 


(19.1) aaa ee 


(a, @) 
which we call a Cartan integer. 


Lemma 19.6. Let (V,®) be a root system and let a,8 € ® be such that a + +6. Then (a,8)(B,a) € 
{0,1,2,3}. It follows that the angle between two such roots a, 8 lies in the set 


{a /2, 7/3, 20/3, 1/4, 30/4, 1/6, 57/6}. 
Moreover, the ratios of root lengths which are not perpendicular must be 1,2, 1/2, 3 or 1/3. 


Proof. By assumption, we know that both (a, 6) and (8, aœ) are integers. On the other hand, by the 
cosine formula (i.e. by Cauchy-Schwarz) we see that if 0 denotes the angle between a and £, then: 
(19.2) (a, B) (B, a) = 4c08(0)? < 4. 


Since cos(6)? determines the angle between the two vectors (or rather the one which is at most 7) 
and (8,a)/(a, 8) = ||a||?/||,8||? (where we write ||v||? = (v,v)), the rest of the Lemma follows by a 
case-by-case check as we see from the following table: 


(ab) | (Ba) | 6 | _llall2/IIBI12 

0 0 a/2 | undetermined 
1 1 1/3 1 

af | 1 208 I 
I 2 | «/4 2 

—1 | —2 | 30/4 2 
1 3 1/6 3 

1 | —3 | 51/6 3 


We now show that any root system must possess a base. In fact we show a little more. If v € V, we 
say that v is regular” if (v, a) 4 0 for all a € ©. Since V is nota finite union of proper subspaces, regular 
vectors certainly exist. Given such a v, write ®*(v) = {a € ©: (v,a) > 0}, and @ (v) = -8+ (v). 
Clearly ® = t (v) |] (v). We say a root a € ®*(v) is decomposable if we can find roots a; € &*(v) 
and nonnegative integers n;, (1 < i < s, s > 2), such that a = X`}; nia;. Let A(v) be the set of 
indecomposable roots, i.e. the roots which are not decomposable, in ®t (v). 


28In the lecture I used f € V*. Since any such f is of the form (v, —) for some v € V this makes no difference, and since later 
I switched to thinking of f in that form, I’m writing everything in terms of v here for simplicity. 
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Proposition 19.7. Let (V, ®) be a root system, and let v € V be a regular vector. Then A(v) is a base for (V, ®) 
and every base is of this form. 


Proof. Let A(v) be the indecomposable roots in ®*(v) as above. We first show that every root in ®*(v) 
is a nonnegative integer combination of indecomposable roots. Indeed let m = min{(v,a) : a € P+}. 
Ify € @*(v), and we have y = X`} nia; where ni € Z>o and a; E€ &*(v), then clearly (v,y) > 
5 nim, hence for any such expression the sum 5>;_, n; is bounded above by (v, a)/m. If we write 
y = J`; nia; where J`}; n; is maximal possible, then we claim each q; is indecomposable. Indeed 
otherwise there is some i such that +; is decomposable, so that we may write y; = es m,Q; for some 
t > 2, positive integers mj, and 8; € &*(v), (1 < j < t). But then y = Xpy; niai + Zi mjnif; and 
ages Me + (©ia mj)ni > Jopi nk + 2ni > J) i- ni contradicting our maximality assumption. Since 
® also spans V, this shows that A( f) spans V, hence since ® = + (v) U —®*(v) in order to show that 
A(v) is a base it only remains to show that A (f) is linearly independent. 

We check this in a number of steps. 
Step 1: The angle between any two vectors a, 3 € A(v) is obtuse, that is (a, 3) < 0: To see this suppose 
that (a, 3) > 0. Then one can check” using the table from the proof of the Lemma from last time on 
the angles between roots that if we take (a, a) > (6, 8) then (8,a) = —1. But then sa(8) = 8 — a E€ Ẹ, 
so that either £ — a or a — £ lie in ®*, and hence either a or 8 will be decomposable. 
Step 2: Linear independence: Suppose that we have )7 <a p) Caa = 0 for ca € Q, (a € A). Gathering 
all the positive and nonpositive coefficients let 


z= E aa = D (-e9)8 


Ca >0 cg <0 


Then we have: 

(zz)= Š,  ca(—cp)(a, 8) < 0, 

a, B:Cq >0,c8 <0 

so by positive definiteness we must have z = 0. But since (v, z) = 2. >o Ca(v,@) = 0 with equality 
if and only if the sum is empty we conclude that for all 8 € A we have cg < 0, but then we have 
0 = )7.,<0(—¢s)8, and pairing with v again we deduce that for all 3 we have cg = 0, and so Ais a 
linearly independent set. 
Step 3: Finally we need to show that any base A is of the form A(v) for some regular v € V. Fora € A, 
let Py = {v € V : (v,a) > 0}. This is a half-space in V. It can be checked that Maea Pa is non-empty 
(indeed this holds for any linearly independent set in V, not just a base®’) so we may pick some v € V 
in this intersection. Then since any root is a nonpositive or nonnegative combination of the roots in 
A, it follows the vector v is regular, and moreover it is clear that P+ C + (v) the set of positive roots 
associated with A, and since ~ = —®*, we see similarly that #7 C ®- (v), and since Ẹ is the disjoint 
union of both + U ®~ and ®*(v) U ®(v) it follows that ®*(v) = +. Finally, since A(v) are the 
indecomposable roots in ®+ = + (v), we must certainly have A(v) C A, and since both are bases of V 
it follows that A = A(v) as required. 


Our proof shows that for a given root system one can always find a base, but it will not be unique: 
different choices of regular v can lead to different positive systems ®+ (v) and hence different bases. 
This ambiguity is however controlled by the action of the Weyl group, as we will now see. From 
now on fix a base A for our root system, and thus have the corresponding positive and negative roots 
+t, ®~. Let Wo be the subgroup of W generated by the reflections sa for a € A. The elements of A are 
called simple roots, and the corresponding reflections sa are called simple reflections. 


Lemma 19.8. Ifa € A, then the reflection sa preserves the set ®*\{a}. Moreover, if p = $ X acg+ o, then 
(p, 7) = (1/2)||7||? > 0 for all y € A and hence 6+ = + (p) 


Proof. Let y € &*\{a}. Since A is a base, we may write y = ` gea cgp where cg € Z>o forall 6 € A, 
and so 
Sal) = (ca—(a,7)Jat+ $ cpe. 
BEA, PFa 


>This either follows just inspecting the table, or by noting that by inequality proved there for a, 8 with 8 # +a we have 
(a, B).(8,a) < 4and so since (a, 3) and (8, a) are integers at least one must be +1. 

3O ust extend your linearly independent set to a basis, and take the dual basis with respect to the inner product- the inter- 
section of half-spaces then become the set of vectors whose coordinates with respect to the appropriate dual basis vectors are 
positive. 
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Now since y # +a (since it is positive) there must be some 69 € A\{a} with cg, > 0. But then by 
the definition of a base we must have sa(y) € ®*, and clearly sa(y) # a since this would imply 
Y = Sala) = —a € `. For the final part, note that if y € A we must have 


s= sts Da) 


aebt\{y} 
1 1 
Sag Dye 
aeht\{y} 
=p- 


now since by definition s,(p) = p — ZeD 7 the result follows immediately. The fact that P+ = + (p) 


(in the notation of Proposition 19.7 then follows immediately from the proof of that proposition. 


Definition 19.9. Given a root system equipped with a base A = {ay,...,a,} we may define a height 
function ht: V —> Q by setting ht(v) = $75, ci where v = J`; ciai. Note that it follows from the 
definition of a base that the height function is integer-valued on ®, and ht(8) = 1 if and only if 6 € A. 


Note that this height function is a special case of the height function used in the proof of Weyl’s 
theorem — if {w; : 1 < i < r} denotes the dual basis to the base A with respect to the inner product, 
then ht(8) = (6, 8) where ô = X ;—] wi. 


Proposition 19.10. Suppose that B € ®. Then there isa w € Wo and an a € A such that w(B) = a. 


Proof. First suppose that 8 € +. We prove the statement by induction on the height of 3. The state- 
ment being clear if 8 is of height 1, we may assume h({) > 0. We claim there is some y € A such that 
(8,7) > 0. If not then writing 8 = D7, cy7/, we see that 


(8,8) = $ (8,7) <0, 


yea 


so by positive definiteness we would have £ = 0. 

Now taking y with (6,7) > 0, we see that s.(3) € ®* using the previous Lemma (since £ is certainly 
not equal to a as h(3) > 1, and moreover h(s,(8)) = h(8) — (7,8) < h(8). Thus by induction we are 
done. Finally we need to consider the case 8 € ®~. But then we have —8 € ®*, and so there is a 
w € Wo such that w(—f) = y for some y € A. But then clearly (s,w)(@) = y and we are done. 


Corollary 19.11. The Weyl group W is generated by the reflections {s} : y € A}, that is W = Wo. 


Proof. If 8 € ® then we have just shown in the previous proposition that there is a w € Wo such that 
w(8) = 7 for some y € A. But the clearly sg = w~'s.,w € Wo, and so since W is generated by the ss 
we have W = Wo as required. 


Theorem 19.12. Suppose that A and A, are bases of (V, ®). Then there isa w € W such that w(A) = Aj. 


Proof. By Proposition 19.7 we know any base is determined by a regular element v € V: given v, the 
base it yields is the set of indecomposable roots in + (v). Thus it is enough to show that there is a 
w in W such that w(®7) = ®t, where ®t and * are the positive roots corresponding to A; and A 
respectively. 

Suppose v is a regular vector such that &+(v) = Pt. We claim that there is a w € W such that w(v) 
satisfies (w(v),a) > 0 for all a € A. This immediately implies that (w(v), a) > 0 for all a € ®+, so that 
+ = + (w(v)), and thus w(®}) = 9, since (w(v), a) > 0 if and only if (v, w7!(a)) > 0. 

To prove the claim, first consider p = $ >< a. By the Lemma 19.8 we know that s.(p) = p — a for 
all a € A. Now choose w € W such that (p, w(v)) is as large as possible. Then if a € A, we must have 


(w(v), p) > (Sa-w(v); p) = (wv), Sa(p)) = (we), p = a) = (wv), p) — (wv), a). 


Thus it follows that (w(v), a) > 0 for all a € A, and since (v, a) 4 0 for all a € ® (as f was assumed to 
be generic) the claim follows. 


Remark 19.13. In fact W acts simply transitively on the bases of (V, ©), that is if w(A) = A then w = 1. 
The proof (which we will not give) consists of examining the minimal length expression for w in terms 
of these generators {sq : a € A}. 
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20. CARTAN MATRICES AND DYNKIN DIAGRAMS 
In this section we describe the data which is used in the classification of semisimple Lie algebras. 


Definition 20.1. Let (V, ®) be a root system. The Cartan matrix associated to (V, ®) is the matrix 


C = ((au, aj) pa 
where {a1, @2,...,a¢} = A is a base of (V, ®). Since the elements of W are isometries, and W acts trans- 
itively on bases of Ẹ, the Cartan matrix is independent of the choice of base (though clearly determined 
only up to reordering the base A). 


Definition 20.2. The entries c;; of the Cartan matrix are all integer with diagonal entries equal to 2, 
and off-diagonal entries c;; € {0,—1, —2, —3} (where i 4 j) such that if ci; < —1 then cj; = —1 so that 
the pair {c;;,c;;} is determined by the product c;;.c;; and the relative lengths of the two roots (e.g. see 
the table in the Lemma about angles between roots). As a result, the matrix can be recorded as a kind 
of graph: the vertex set of the graph is labelled by the base {a1,...,a;}, and one puts (a;, a;).(a;, a) 
edges between a; and aj, directing the edges so that they go from the larger root to the smaller root. 
Thus for example if (a;i, aj) = —2 and (aj, ai) = —1 so that ||a;||? > ||a;||?, that is, a; is longer than aj, 
we record this in the graph as: 
Xi e <—— @% 
The resulting graph is called the Dynkin diagram. 


For the next theorem we need to formulate what it means to have an isomorphism of root systems. 
This is given in the natural way: if (V,®) and (V’, ®’) are root systems, a linear map ¢: V > V’ is an 
isomorphism of root systems if 

(1) The map ¢ is an isomorphism of vector spaces. 

(2) ¢(®) = P’, and (a, 8) = (¢(a), ¢(8)) for all a, 8 € ®. 
Note that ¢ need not be an isometry (e.g. we could scale V by a nonzero constant c € Q to obtain (V, c®) 
a distinct, but isomorphic root system to (V, ®). 


Theorem 20.3. Let (V, ©) be a root system. Then (V, ®) is determined up to isomorphism by the Cartan matrix, 
or Dynkin diagram associated to it. 


Proof. Given root systems (V,®) and (V’, ®’) with the same Cartan matrix, we may certainly pick a 
base A = {ay,..., ae} of (V, ®) and a base A’ = {(),..., Be} of (V’, ®’) such that (a;i, aj) = (Bi, Bj) for 
all i,j, (1 < i,j < £). We claim the map ¢: A > A’ given by ¢(a;) = 8; extends to an isomorphism 
of root systems. Clearly, since A and A’ are bases of V and V’ respectively, ¢ extends uniquely to an 
isomorphism of vector spaces ¢: V — V’, so we must show that ¢(®) = ©’, and (¢(a), o(8)) = (a, 8) 
for each a, 8 € ®. 

Let si = Sq, E€ O(V) and s; = sg, € O(V’) be the reflections in the Weyl groups W and W” respect- 
ively. Then from the formula for the action of s; it is clear that ¢(s;(a;)) = s4(8;) = s4(@(a;)), so since 
A is a basis it follows $(s;(v)) = s;(¢(v)) for all v € V. But then since the s;s and s/s generate W and 
W’ respectively, ¢ induces an isomorphism W —> W’, given by w +> w' = ġ o w o ¢7!. But then given 
any a € ® we know there is a w € W such that a = w(a;) for some j, (1 < j < &. Thus we have 
la) = ¢(w(a;)) = w'(d(a;)) = w'(B;) E P, so that ¢(@) C ®’. Clearly the same argument applied to 
@ ' shows that ¢~1(®’) C ® so that (®) = ©’. 

Finally, note that it is clear from the linearity of ¢ and of (a, y} in the second variable, that (a, y) = 
(o(a), o(y)) for all a € A, y € ©. In the same fashion as above however, if a € @ is arbitrary, then we 
may find w € W such that a = w(a;) € A, and thus ¢(a) = w'(6;), whence we have 


(Hla), (7) = (w (Bj), (7) = Bi wT A) 
= (aj, w~*(7)) = (w(az),7) = (2,7). 


as required. 


Thus to classify root systems up to isomorphism it is enough to classify Cartan matrices (or Dynkin 
diagrams). 


Definition 20.4. We say that a root system (V, ®) is reducible if there is a partition of the roots into two 
non-empty subsets ®; L ®2 such that (a, 6) = 0 for all a € ®),6 € ®2. Then if we set Vi = span(®;) 
and V2 = span(®z), clearly V = V; ® V2 and we say (V, ®) is the sum of the root systems (V;, ®;) and 
(V2, ®2). This allows one to reduce the classification of root systems to the classification of irreducible 
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root systems, i,e. root systems which are not reducible. It is straight-forward to check that a root system 
is irreducible if and only if its associated Dynkin diagram is connected. 


Definition 20.5. (Not examinable.) The notion of a root system makes sense over the real, as well as 
rational, numbers. Let (V,®) be a real root system, and let A = {a1,a2,...,a;} be a base of ®. If 
vi = a;/||ai|| (1 < i < l) are the unit vectors in V corresponding to A, then they satisfy the conditions: 
(1) (vi, vi) = 1 for alli and (vi, v;) < Oifi Æ J, 
(2) If i A j then 4(v;, vj)? € {0,1,2,3}. (This is the reason we need to extend scalars to the real 
numbers — if you want you could just extend scalars to Q(v2, v3), but it makes no difference 
to the classification problem). 


Such a set of vectors is called an admissible set. 
It is straightforward to see that classifying Q-vector spaces with a basis which forms an admissible 
set is equivalent to classifying Cartan matrices, and using elementary techniques it is possible to show 


that that the following are the only possibilities (we list the Dynkin diagram, a description of the roots, 
and a choice of a base): 


e Type Ag (£ > 1): 


£ 


V={v= Soe cQ: 5 a=0}, = {e-e 1 St AG <4 
i=1 
A = {cip — E&i: 1<i<Ll-1} 


e Type Be (£ > 2): 


e-.-.-e=~e 


V=Q = {tete,:1<1,5< 21 Aj} Ula :1<i< 4, 
A = {61,641 —G: : 1 <i < l—1} 
e Type C2 (£ > 3): 


e-.-.-e<—~e 


V =Q, = {te te; :1<i,j< fi Aj} U (26 :1<i< 8, 
A = {261,641 — E&i: 1 <i <l-1} 
e Type De (£ > 4): 


V =Q", = {tei tej: 1<ij<LiZ 5}, 
A = {£1 + 62,614 — E&i: br < f= 1} 


e Type Go. 
e < o 
Let e = £1 + E2 + £3 E€ Q3, then: 
V = {v E€ Q? : (v,e) =0},® = {ci — Ej : i j} U {£(3e;—e):1<i<3} 
A = {£1 — £2, € — 3e1} 


e Type Fy: 


e —> e 


V =Q, 


1 
= {+e;:1<i< 4} U {te; ej it FULTS €ytegte3 + €4)} 


1 
A = {€2 — €3,€3 E4, E4, 5 (E1 E2 — E3 — €4)}- 
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e Type En (n = 6,7, 8). 


These can all be constructed inside Eg by taking the span of the appropriate subset of a base, 
so we just give the root system for Eg. 


8 8 
1 
8 pee cae 
V=Q,7,@={te, +e; :i GDA 1) i DBC 22), 


A = {£1 + E2, Ei41 — Ei, 3 (E1 + E8 — (E2 +E3 +... +€7)):1<i <6} 


Note that the Weyl groups of type Be and Cy are equal. The reason for the restriction on £ in the 
types B,C,D is to avoid repetition, e.g. Bz and C% are the same up to relabelling the vertices. 


Remark 20.6. I certainly don’t expect you to remember the root systems of the exceptional types, but 
you should be familiar with the ones for type A, B, C and D. The ones of rank two (i.e. A2, Bz and G2) 
are also worth knowing (because for example you can draw them!) 


21. THE CLASSIFICATION OF SEMISIMPLE LIE ALGEBRAS 


Only the statements of the theorems in this section are examinable, but it is important to know these statements! 

Remarkably, the classification of semisimple Lie algebras is identical to the classification of root 
systems: each semisimple Lie algebra decomposes into a direct sum of simple Lie algebras, and it is 
not hard to show that the root system of a simple Lie algebra is irreducible. Thus to any simple Lie 
algebra we may attach an irreducible root system. 

A first problem with this as a classification strategy is that we don’t know our association of a root 
system to a semisimple Lie algebra is canonical. The difficulty is that, because our procedure for at- 
taching a root system to a semisimple Lie algebra involves a choice of Cartan subalgebra, we don’t 
currently know it is a bijective correspondence — possibly the same Lie algebra has two different Cartan 
subalgebras which lead to different root systems. The theorem which ensures this is not the case is the 
following, where the first part is the more substantial result (though both require some work): 


Theorem 21.1. Let g be a Lie algebra over any algebraically closed field k. 
(1) Let b,b’ be Cartan subalgebras of g. There is an automorphism @: g —> g such that (b) = 6’. 
(2) Let gı, g2 be semisimple Lie algebras with Cartan subalgebras bı, ba respectively, and suppose now k is 
of characteristic zero. Then if the root systems attached to (g1, 41) and (g2, h2) are isomorphic, there is 
an isomorphism $: gı — go taking bı to he. 


Once you know that the assignment of a Dynkin diagram captures a simple Lie algebra up to iso- 
morphism, we still need to show all the root systems we construct arise as the root system of a simple 
Lie algebra. That is exactly the content of the next theorem. 


Theorem 21.2. There exists a simple Lie algebra corresponding to each irreducible root system. 


There are a number of approaches to this existence theorem. A concrete strategy goes as follows: 
one can show that the first four infinite families A, B, C, D correspond to the classical Lie algebras, 
sle41, 502041, SP2e, $02¢, whose root systems can be computed directly (indeed you did a number of 
these calculations in the problem sets). This of course also requires checking that these Lie algebras are 
simple (or at least semisimple) but this is also straight-forward with the theory we have developed. It 
then only remains to construct the five “exceptional” simple Lie algebras. This can be done in a variety 
of ways — given a root system where all the roots are of the same length there is an explicit construction 
of the associated Lie algebra by forming a basis from the Cartan decomposition (and a choice of base 
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of the root system) and explicitly constructing the Lie bracket by giving the structure constants with 
respect to this basis (which, remarkably, can be chosen for the basis vectors corresponding to the root 
subspaces to lie in {0, +1}). This gives in particular a construction of the Lie algebras of type E¢, E7, Es 
(and also A; and D; though we already had a construction of these). The remaining Lie algebras can 
be found by a technique called “folding” which studies automorphisms of simple Lie algebras, and 
realises the Lie algebras G2 and F as fixed-points of an automorphism of D4 and Ee respectively. 

There is also an alternative, more a posteriori approach to the uniqueness result which avoids show- 
ing Cartan subalgebras are all conjugate for a general Lie algebra: one can check that for a classical Lie 
algebra g C gl,, as above, the Cartan subalgebras are all conjugate by an element of Aut(g) (in fact you 
can show the automorphism is induced by conjugating with a matrix in GL,,(k)) using the fact that a 
Cartan subalgebra of a semisimple Lie algebra is abelian and consists of semisimple elements. This 
then shows the assignment of a root system to a classical Lie algebra is unique, so it only remains to 
check the exceptional Lie algebras. But these all have different dimensions, and the dimension of the 
Lie algebra is captured by the root system, so we are done.*! 

We conclude by mentioning another, quite different, approach to the existence result, using the 
Serre’s presentation: just as one can describe a group by generators and relations, one can also describe 
Lie algebras in a similar fashion. If g is a semisimple Lie algebra and A = {a1,...,a¢} is a base of the 
corresponding root system with Cartan matrix C = (a;;) then picking bases for the sl,,-subalgebras 
corresponding to them, it is not too hard to show that g is generated by the set {ea, fa, ha : a € A}. 

The Serre presentation gives an explicit realisation, given an arbitrary root system, of the relations 
which one needs to impose on a set of generators for a Lie algebra labelled {ea, fa, ha : a E ®} as 
above obtain a semisimple Lie algebra whose associated root system is the one we started with. This 
approach has the advantage of giving a uniform approach, though it takes some time to develop the 
required machinery. 


22. APPENDIX 1: BACKGROUND ON SYMMETRIC BILINEAR FORMS 


In this section we review the basics of symmetric bilinear forms over a field k. It is all material that 
is almost in Part A Algebra, but perhaps not quite phrased there the way we use it. We shall work to 
begin with over an arbitrary field k. 


Definition 22.1. Let V be a k-vector space. A function B: V x V — kis said to be bilinear if it is linear 
in each factor, that is if 

B(Aiv2 + A202, w) = MB(wv1, w) + A2B(vo, w), 
and 

Biv, Àw + A2W2) = Ai Biv, w) + A2B(v, we), 
for all v, v1, v2, w, w1, w2 E€ V, Ai, A2 E€ k. We say that it is symmetric if B(v, w) = B(w,v). The radical of 
a symmetric bilinear form is the subspace: 


rad(B) = {v € V : Biv, w) =0,Vw E€ V}. 


We say that B is nondegenerate if rad(B) = {0}. Let Bil(V) denote the vector space of bilinear forms on 
V, and?” SBil(V) for the space of symmetric bilinear forms on V. 


Lemma 22.2. There is a natural isomorphism ©: Bil(V) + Hom(V,V*). 


Proof. Suppose that B € Bil(V). Then define O(B) = 0 € Hom(V,V%) as follows: given v € V, let 
6(v): V > k be the function given by 6(v)(w) = B(v, w). The linearity of B in the second variable then 
ensures that 0(v) € V*, while the linearity of B in the first variable ensures that the map v +> 4(v) is 
linear, so that @ is induce a linear map from V to V*. 

Conversely, given 6: V + V*, define By by Bo(v, w) = 0(v)(w). It is easy to see that By € Bil(V), 
and that 0 +> Bọ gives a linear map Hom(V, V*) — Bil(V) which is clearly inverse to the map 9, so 
they are both isomorphisms. 


Remark 22.3. The lemma shows that giving a bilinear form on V is equivalent to giving a linear map 
from V to V*. Note that we made a choice in the above construction, since given B € Bil(V) we could 
have defined 6 by 0(v)(w) = B(w, v). For symmetric bilinear forms the two possible choices agree, but 
for arbitrary bilinear forms they establish different isomorphisms. 


31 This is completely rigorous, but feels like cheating (to me). 
32This is not standard notation — it would be more normal to write something like Sym?(V*) but then I'd have to explain 
why... 
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From now on we will only work with symmetric bilinear forms. The kernel of 0 = O(B) is clearly 
the subspace of v € V such that 6(v)(w) = 0 for all w € V, which is exactly the definition of rad(B) as 
required. 


Now fix B € SBil(V). Then if U is a subspace of V, we define 
UŁ = {v € V : B(v, w) = 0,Yw € U}. 


If 0: V — V* is the associated map from V to V*, then clearly U+ = 671 (U?). When B is nondegener- 
ate, so that @ is an isomorphism, this shows that dim(U+) = dim(V) — dim(U). The next Lemma shows 
that this can be refined slightly. 


Lemma 22.4. Let V be a finite-dimensional k-vector space equipped with a symmetric bilinear form B. Then for 
any subspace U of V we have the following: 

(1) dim(U) + dim(U+) > dim(V). 

(2) The restriction of B to U is nondegenerate if and only if V = U ẹ U+. 


Proof. For the first part, define a map ¢: U + V* by ¢(u)(v) = B(u, v), (u € U,v € V). Then U+ is by 
definition exactly the subspace of (im(¢))? (ie. the annihilator of im(¢) under the natural identification 
of V with its double dual (V*)*). It follows that dim(U+) + dim(im(¢)) = dim(V), and hence certainly 


dim(U+) + dim(U) > dim(V). 


For the second part, it is immediate from the definitions that B is nondegenerate when restricted to 
U if and only if UNU+ = {0}, ie. if and only if the sum of U and U* is direct. But then the equivalence 
follows immediately from the dimension inequality in the first part. 


22.1. Classification of symmetric bilinear forms. This subsection is not needed for the course? but 
might be clarifying. There is a natual linear action of GL(V) on the space Bil(V): if g € GL(V) and 
B € Bil(V) then we set g(B) to be the bilinear form given by 


g(B)(v,w) = B(g"(v), g (w)), (ww EV), 


where the inverses ensure that the above equation defines a left action. It is clear the action preserves 
the subspace of symmetric bilinear forms. 

Since we can find a invertible map taking any basis of a vector space to any other basis, the next 
lemma says that over an algebraically closed field there is only one nondegenerate symmetric bilinear 
form up to the action of GL(V), that is, when k is algebraically closed the nondegenerate symmetric 
bilinear forms are a single orbit for the action of GL(V). 


Lemma 22.5. Let V be a k-vector space equipped with a nondegenerate symmetric bilinear form B. Then if 
char(k) # 2, there is an orthonormal basis of V , i.e a basis {v1,...,Un} of V such that B(v;, vj) = ôij. 


Proof. We use induction on dim(V). The identity” 


Biv, w) = 5 (Ble +w,v +w) — Biv,v) — Biw,w)), 
shows that if B 4 0 we may find a vector v € V such that B(v,v) 4 0. Rescaling by a choice of square 
root of B(v,v) (which is possible since k is algebraically closed) we may assume that B(v,v) = 1. 
But if L = k.v then since Bj, is nondegenerate, the previous lemma shows that V = L ẹ L~, and if 
B is nondegenerate on V it must also be so on L+. But dim(Z+) = dim(V) — 1, and so L+ has an 
orthonormal basis {v1,...,Un—1. Setting v = vn, it then follows {v1,...,vn} is an orthonormal basis of 
V as required. 


Remark 22.6. Over the real numbers, for example, there is more than one orbit of nondegenerate sym- 
metric bilinear form, but the above proof can be modified to give a classification and it turns out that 
there are dim(V) + 1 orbits. 


3350 in particular you don’t need to know it for any exam... 

34Note that this identity holds unless char(k) = 2. It might be useful to remember this identity when understanding the 
Proposition which is the key to the proof of the Cartan Criterion: it claims that if g = Dg then there is an element x € g with 
k(x, x) £ 0. Noting the above identity, we see this is equivalent to asserting that « is nonzero. 
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23. APPENDIX 2: REPRESENTATION THEORY BACKGROUND 


We recall here some basics of representation theory used in the course, all of which is covered (in 
much more detail than we need) in the Part B course on Representation theory. Let g be a Lie algebra. 
The main body of the notes proves all that is needed in the course, but the material here might help 
clarify some arguments. We will always assume our representations are finite dimensional unless we 
explicitly say otherwise. 


Definition 23.1. A representation is irreducible if it has no proper nonzero subrepresentations. A rep- 
resentation (V, p) is said to be indecomposable if it cannot be written as a direct sum of two proper 
subrepresentations. A representation is said to be completely reducible if is a direct sum of irreducible 
representations. 


Clearly an irreducible representation is indecomposable, but the converse is not in general true. 
For example k? is naturally a representation for the nilpotent Lie algebra of strictly upper triangular 
matrices n2 C gh (k) and it is not hard to see that it has a unique 1-dimensional sub representation, 
hence it is indecomposable, but not irreducible. 

A basic observation about irreducible representations is Schur’s Lemma: 


Lemma 23.2. Let g be a Lie algebra and let (V, p), (W,a) be irreducible representations of g. Then any g- 
homomorphism 6: V —> W is either zero or an isomorphism. Moreover, if k is algebraically closed, then 
Hom,(V,W) is one-dimensional. 


Proof. The proof is exactly the same as the proof for finite groups. If ¢ is nonzero, then ker(@) is a 
proper subrepresentation of V, hence as V is irreducible it must be zero. It follows V is isomorphic to 
(V), which is thus a nonzero subrepresentation of W. But then since W is irreducible we must have 
W = (V) and ¢is an isomorphism as claimed. 

Thus if Hom,(V, W) is nonzero, we may fix some ¢: V — W an isomorphism from V to W. Then 
given any g-homomorphism a: V — W, composing with ¢~' gives a g-homomorphism from V to 
V, thus it is enough to assume W = V. But then if a: V — V is a g-endomorphism of V, since k is 
algebraically closed, it has an eigenvalue À and so ker(a — A) is a nonzero subrepresentation, which 
must therefore be all of V, that is a = \.idy, so that Hom,(V, V) is one-dimensional as claimed. 


Definition 23.3. A representation (V, p) is said to be semisimple if any subrepresentation U has a com- 
plement, that is, there is a subrepresentation W such that V = U @ W. A representation is said to be 
completely reducible if it is a direct sum of irreducible representations. 


Lemma 23.4. If V is a semisimple representation, then any subrepresentation or quotient representation of V is 
semisimple. 


Proof. Supose that q: V — W is a surjective map, and that V is semisimple. We claim that W is 
semisimple. Indeed if W, is a subrepresentation of W, the q~'(W1) = V; is a subrepresentation of V, 
which has a complement V2. Then it follows easily that q(V2) is a complement to W; in W: indeed it is 
clear that W = W, + q(V2) since V = q7'(Vi) @ Va, and if w € q(V2) N Wi, we may write w = q(v) for 
some v € V2, but then v € q~!(W1), hence w € q7!(Vi) O V2 = {0}. 

Next, if U is a subrepresentation of V, then picking a complement U’ to U, so that V = U @U’, the 
corresponding projection map 7: V — U with kernel U’ shows that U is isomorphic to a quotient of V, 
and hence is also semisimple. 


Lemma 23.5. f Let (V, p) be a representation. Then the following are equivalent: 
i) V is semisimple, 
ii) V is completely reducible, 
iii) V is the sum of its irreducible subrepresentations. 


Proof. Once we know that any subrepresentation of a semisimple representation is again semisimple, 
the proof of part ii) of Lemma 15.3 shows that i) implies iz). Certainly ii) implies iii) so it is enough 
to show that iii) implies i). For this, suppose that V is the sum of its irreducible subrepresentations 
and that U is a subrepresentation of V. Let W be a subrepresentation of V which is maximal (with 
respect to containment) subject to the condition that U N W = {0}. We claim that V = U ẹ W. To 
see this, suppose that U p W # V. Then by our assumption on V there must be some irreducible 
subrepresentation X with X not contained in W @ U, and hence X N (W GU) = {0}. But then we 
certainly have% (X © W) N U = {0}, which contradicts the maximality of W, so we are done. 


since both are just expressing the fact that the sum X + W + U is direct. 
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Lemma 23.6. Let g be a Lie algebra. A representation is semisimple if and only if every surjective map of 
representations q: V — W has a right inverse, that is there is a map of representations s: W — V such that 
qos= idw. 


Proof. Suppose that V is semisimple. Then given a surjection q: V + W, we may find a complement 
U to the subrepresentation ker(q). Clearly then qy is an isomorphism from U to W, and its inverse 
gives the required homomorphism s. For the converse, if U is a subrepresentation, then the quotient 
map q: V > V/U has a right inverse s. We claim that V = U @ s(V/U). Suppose that u € V lies in 
the intersection. Then u = s(w) for some w € V/U, and hence (since U is the kernel of q) we have 
0 = q(u) = g(s(w) = w, so that u = 0. Thus U N s(V/U) = {0}, and thus by dimension counting we see 
that V =U @ s(V/U) and s(V/U) is a complement as required. 


Remark 23.7. Our proof of Weyl’s theorem showed any surjective map of representations splits, hence 
any representation is semisimple, and hence completely reducible. Thus it only uses the chain of im- 
plications: 

surjections split —> semisimple — > completely reducible, 


while above we have shown that in fact all three statements are equivalent. 


As we saw in our study of representations of nilpotent Lie algebras, it is not always the case that 
a representation is a direct sum of irreducible representations, i.e. a representation is not always com- 
pletely reducible, and for these representations the notion of a composition series is useful: It shows 
that, even if a representation is not completely reducible, we may still think of it as being built out of 
irreducibles. Indeed using induction on dimension, it is easy to see that any representation may be 
filtered by subrepresentations 0 = Vo Ç Vi € ... © Vk = V where each successive quotient V;/V;—1ı 
is irreducible. We call such a filtration a composition series for V, and the irreducible representations 
V;/V;-1 (1 < i < k) are called composition factors. 

Although we don’t use it in the course, the following result is shows that the composition factors are 
actually independent of the filtration: If L is a simple representation then write [L, V] for the number 
of times L occurs in the composition series (i.e. [L, V] = #{i: 1 <i<k,Vi/Vi-i S L}. 


Lemma 23.8. (Jordan-Holder). The numbers |L, V] are independent of the composition series. 


Proof. Use induction on the dimension of V. Clearly if this is 1, then V is irreducible and the result 
is clear. Now suppose that (V;)*_,) and (W;)}_, are two composition series for V. There is a smallest 
j such that W; N V; is nonzero, and then since V; is irreducible we must have V; N W; = Vj, that is, 
Vı C W,. But then the induced map V; —> W,;/W;_1 must be an isomorphism by Schur’s Lemma, so 
that W; = W;-1 $ Vi. and so setting 


ag (W: @ Va) / Va ifi < j, 
"a= Wi+1/V1 ifj <l-1. 
we obtain a composition series of V/V|, whose composition factors are those of the composition series 


(W;){_, for V, with one fewer copy of the isomorphism class of V; & W; /W;—1. By induction it has the 
same composition factors as the filtration {V;/V, : 1 < i < k}, and we are done. 


Remark 23.9. If we take g = gl (k), then a representation (V, p) of g is completely determined by the 
linear map a = p(1). Subrepresentations of V correspond to subspaces which are invariant for the 
linear map a, and the existence of eigenvalues shows that the irreducible (finite dimensional) repres- 
entations of gl,(k) are one-dimensional and hence completely reducible representations are the ones 
for which a is diagonalisable (or in the terminology favoured in this course, “semisimple”!) Notice 
that Lemma 23.4 then yields that fact shows that if a linear map is diagonalisable, it is diagonalisable 
on any invariant subspace. 


